The following deliverables and reports summarise the work performed by Solve-RD partners.
D1.3 Training modules, guidance document and online help module for collection of phenotypes
In order to engage users with Solve-RD and facilitate their contribution of phenotypic data to the project, we have developed training materials and have carried out training activities, which will continue in time. We have set up a practical guide to PhenoTips data entry, provided an in-depth YouTube video with instructions, organized our first training webinar for ERNs (available here) and created an option for uploading phenotype data using an Excel template. In addition, further support to researchers is provided by one-to-one tele-conferences for PhenoTips usage and by answering any queries they have through email@example.com.
D1.4 Deployment of PhenoTips custom forms according to the ERNs specifications
This deliverable report describes the adaptations and customisations CNAG-CRG have carried out in PhenoTips to allow for the collation of phenotype data tailored to each ERN and disease group. Users can enter data individually for each proband and family member through one of the customized templates, import information using JSON schemas or provide a filled-in Excel template for bulk upload.
D1.5 Guidelines for collection of experimental data
This deliverable report provides information on the guidelines that have been developed for experimental data collection for Solve-RD. These guidelines aim to aid and facilitate all data upload from all Solve-RD collaborators and ensure high quality standards are met.
D1.10 Adaptation of BOQA algorithm to its use in the ontology of unsolved RD
WP1 not only aims to collect standardized phenotypic information from unsolved rare disease (RD) cases, but also aims to transform their phenotypic descriptions into diagnostic hypotheses. One of the proposed means to produce these diagnostic hypotheses is calculating a numerical similarity value that reflects how well the phenotype information of an unsolved case aligns with other solved or unsolved cases as well as how well it overlaps with known rare diseases.
In this report we present a software tool we developed and that adapts several algorithms to calculate the degree of similarity between unsolved cases and known diseases using solved cases. The results of this tool will be imported directly into the ontology of rare unsolved cases (RDCO).
We have tested the tool and implemented algorithms on 107 PhenoPackets obtained from solved cases. We identified two algorithms that show superior performance on this relatively small test set and were able to run this tool on Orphanet computers.
D2.3 Guidelines for Quality Control metrics
Solve-RD has collated over 8,400 standardised phenotypic and genomic datasets from partners across different ERNs and countries. To ensure best practices and standardisation of the process, HPO, OMIM and Orphanet (ORDO) ontologies are used to collect phenotypic data, and GATK best practices and GA4GH standards are followed in the collection and processing of genomic data through a standardised pipeline (Laurie et al., 2016 here).
As data submitted to Solve-RD for reanalysis has been sequenced at a variety of different centres, under different protocols and using different technologies it is fundamental to ensure a minimum quality of geno-pheno datasets to guarantee proper downstream analyses and results. Therefore, several quality control metrics have been established, and are now automatically performed for each of the samples entering the Solve-RD project.
Here we report on the established framework for quality assessment (processing checkpoints, genome coverage metrics, phenotypic data and sample relatedness) for RD-Connect and Solve-RD data and provide guidelines to enable Solve-RD data submitters and Data Analysis Task Force (DATF) members to easily assess the quality of the provided data and compare genomic datasets before undertaking further downstream analyses.
D2.5 Report on new matchmaking strategies
Matchmaking technologies cover a very specific data findability need by enabling the discovery of individuals with similar phenotypes and/or potential causative genetic variants, among others. Solve-RD has contributed to the development and implementation of several matchmaking strategies which are now available to the consortium but can also be deployed in other settings. The RD-Connect Genome Phenome Analysis Platform (GPAP) is connected to the MatchMaker Exchange network (MME), which enables users, including all Solve-RD partners, to query for individuals with similar symptoms and/or candidate disease genes within the GPAP (internal matchmaking), and against four external databases: PhenomeCentral, DECIPHER, GeneMatcher, and MyGene2. The PhenoStore module allows the users to find individuals with specific categories within the RD-Connect GPAP. The CohortApp module enables the users to create in-silico cohorts according to certain search criteria; such cohorts can then be used to launch genetic queries within the RD-Connect GPAP analysis module. Finally, powerful search, discovery and matchmaking capabilities are provided through RD3/Sandbox and Discovery Nexus.
D3.5 Treatabolome database
The Treatabolome: flagging treatable genes and variants. The database will be connected to the RD-Connect GPAP platform and made accessible as part of the real-time analysis of patients undergoing sequencing or exome analysis within Solve-RD as a proof of concept for the utility of the approach.
D3.8 First summer school for ePAGs delivered
This deliverable report covers the first edition of the EURORDIS Winter School, the capacity building training programme for rare disease patient representatives on scientific innovation and translational research, which was held at the Imagine Institute for Genetic Diseases in Paris from 19-23 March 2018.
D3.9 Second training for ePAGs delivered
This deliverable report covers the second edition of the EURORDIS Winter School, the capacity building training programme for rare disease patient representatives on scientific innovation and translational research, which was held at the Imagine Institute for Genetic Diseases in Paris, France from 11-15 March 2019.
D4.3 Central RD-Connect database serving Solve-RD, including user authentication and authorization
Solve-RD will employ the RD-Connect Genome-phenome Analysis Platform (GPAP) to pool and enable controlled access to a large number of harmonised and integrated datasets from unsolved rare disease cases. This deliverable report describes the actions taken to ensure GPAP is serving Solve-RD, including steps for user authentication and authorization and how researchers can share their data.
D4.5 Metadata catalog operational, with initial content
In order for the resources that are contributed to and will be developed during the Solve-RD project a discovery system was required. Such a system is being developed based on proven technologies and a suitable set of standards. This deliverable focuses on the building of an initial version of the system, based on the Café Variome platform, for asset discovery called RD-NEXUS (Rare Disease Network for EXploring the UNseen). In order to build the system, a working data model for an agreed set of parameters (termed 'findable facets’) was defined and integrated in to a data model. APIs allowing interoperability with other systems were also developed in collaboration with the GA4GH. Exemplar data have been processed and entered into the current RD-NEXUS system to illustrate its functionality and highlight any potential issues, before being demonstrated to potential users within the ERN networks. A complete first version of RD-NEXUS was thereby created, and is now available for demonstration and testing.
D4.7 All foundational standards selected and implemented across the project
Underpinning all activities at Solve-RD, from data submission, quality control, data dissemination to appropriate resources, discovery and finally distribution, a set of defined standards are required to ensure smooth and efficient data flow and interoperability between resources within Solve-RD and external resources. This deliverable focuses on defining and implementing the set of standards required to facilitate these processes, and here we describe how these standards have been established and implemented across Solve-RD.
D5.1 Bespoke Phenotips frontends for associated ERNsand undiagnosed disease programmes
This report provides information on the new clinical submission forms designed and implemented in GPAP-PhenoStore, the phenotypic module of the RD-Connect Genome-Phenome Analysis Platform (GPAP). These templates have been designed in collaboration with clinical experts from Solve-RD WP1 and in alignment with Genomics England data models. This implementation facilitates the collation and future portability of structured clinical information of unsolved patients from associated ERNs and undiagnosed disease programs.
D6.4 Solve-RD communication and dissemination tools
This deliverable report includes the Solve-RD communication and dissemination plans and describes the tools Solve-RD uses to implement both.