Next-generation phenotyping integrated in a national framework for patients with ultrarare disorders improves genetic diagnostics and yields new molecular findings

Schmidt, Axel; Danyel, Magdalena; Grundmann, Kathrin; Brunet, Theresa; Klinkhammer, Hannah; Hsieh, Tzung-Chien; Engels, Hartmut; Peters, Sophia; Knaus, Alexej; Moosa, Shahida; Averdunk, Luisa; Boschann, Felix; Sczakiel, Henrike Lisa; Schwartzmann, Sarina; Mensah, Martin Atta; Pantel, Jean Tori; Holtgrewe, Manuel; Bösch, Annemarie; Weiß, Claudia; Weinhold, Natalie; Suter, Aude-Annick; Stoltenburg, Corinna; Neugebauer, Julia; Kallinich, Tillmann; Kaindl, Angela M.; Holzhauer, Susanne; Bührer, Christoph; Bufler, Philip; Kornak, Uwe; Ott, Claus-Eric; Schülke, Markus; Nguyen, Hoa Huu Phuc; Hoffjan, Sabine; Grasemann, Corinna; Rothoeft, Tobias; Brinkmann, Folke; Matar, Nora; Sivalingam, Sugirthan; Perne, Claudia; Mangold, Elisabeth; Kreiss, Martina; Cremer, Kirsten; Betz, Regina C.; Mücke, Martin; Grigull, Lorenz; Klockgether, Thomas; Spier, Isabel; Heimbach, André; Bender, Tim; Brand, Fabian; Stieber, Christiane; Morawiec, Alexandra Marzena; Karakostas, Pantelis; Schäfer, Valentin S.; Bernsen, Sarah; Weydt, Patrick; Castro-Gomez, Sergio; Aziz, Ahmad; Grobe-Einsler, Marcus; Kimmich, Okka; Kobeleva, Xenia; Önder, Demet; Lesmann, Hellen; Kumar, Sheetal; Tacik, Pawel; Basin, Meghna Ahuja; Incardona, Pietro; Lee-Kirsch, Min Ae; Berner, Reinhard; Schuetz, Catharina; Körholz, Julia; Kretschmer, Tanita; Di Donato, Nataliya; Schröck, Evelin; Heinen, André; Reuner, Ulrike; Hanßke, Amalia-Mihaela; Kaiser, Frank J.; Manka, Eva; Munteanu, Martin; Kuechler, Alma; Cordula, Kiewert; Hirtz, Raphael; Schlapakow, Elena; Schlein, Christian; Lisfeld, Jasmin; Kubisch, Christian; Herget, Theresia; Hempel, Maja; Weiler-Normann, Christina; Ullrich, Kurt; Schramm, Christoph; Rudolph, Cornelia; Rillig, Franziska; Groffmann, Maximilian; Muntau, Ania; Tibelius, Alexandra; Schwaibold, Eva M. C.; Schaaf, Christian P.; Zawada, Michal; Kaufmann, Lilian; Hinderhofer, Katrin; Okun, Pamela M.; Kotzaeridou, Urania; Hoffmann, Georg F.; Choukair, Daniela; Bettendorf, Markus; Spielmann, Malte; Ripke, Annekatrin; Pauly, Martje; Münchau, Alexander; Lohmann, Katja; Hüning, Irina; Hanker, Britta; Bäumer, Tobias; Herzog, Rebecca; Hellenbroich, Yorck; Westphal, Dominik S.; Strom, Tim; Kovacs, Reka; Riedhammer, Korbinian M.; Mayerhanser, Katharina; Graf, Elisabeth; Brugger, Melanie; Hoefele, Julia; Oexle, Konrad; Mirza-Schreiber, Nazanin; Berutti, Riccardo; Schatz, Ulrich; Krenn, Martin; Makowski, Christine; Weigand, Heike; Schröder, Sebastian; Rohlfs, Meino; Vill, Katharina; Hauck, Fabian; Borggraefe, Ingo; Müller-Felber, Wolfgang; Kurth, Ingo; Elbracht, Miriam; Knopp, Cordula; Begemann, Matthias; Kraft, Florian; Lemke, Johannes R.; Hentschel, Julia; Platzer, Konrad; Strehlow, Vincent; Abou Jamra, Rami; Kehrer, Martin; Demidov, German; Beck-Wödl, Stefanie; Graessner, Holm; Sturm, Marc; Zeltner, Lena; Schöls, Ludger J.; Magg, Janine; Bevot, Andrea; Kehrer, Christiane; Kaiser, Nadja; Turro, Ernest; Horn, Denise; Grüters-Kieslich, Annette; Klein, Christoph; Mundlos, Stefan; Nöthen, Markus; Riess, Olaf; Meitinger, Thomas; Krude, Heiko; Krawitz, Peter M.; Haack, Tobias; Ehmke, Nadja; Wagner, Matias

doi:10.1038/s41588-024-01836-1

Download PDF

Article
Open access
Published: 22 July 2024

Next-generation phenotyping integrated in a national framework for patients with ultrarare disorders improves genetic diagnostics and yields new molecular findings

Axel Schmidt ORCID: orcid.org/0000-0002-9780-7243¹^na1,
Magdalena Danyel^2,3^na1,
Kathrin Grundmann⁴^na1,
Theresa Brunet ORCID: orcid.org/0000-0002-5183-780X⁵^na1,
Hannah Klinkhammer ORCID: orcid.org/0000-0003-3752-1275^6,7,
Tzung-Chien Hsieh ORCID: orcid.org/0000-0003-3828-4419⁶,
Hartmut Engels ORCID: orcid.org/0000-0003-1007-1809¹,
Sophia Peters¹,
Alexej Knaus ORCID: orcid.org/0000-0003-0366-0533⁶,
Shahida Moosa ORCID: orcid.org/0000-0002-4463-3067⁸,
Luisa Averdunk⁹,
Felix Boschann^2,3,
Henrike Lisa Sczakiel^2,3,
Sarina Schwartzmann²,
Martin Atta Mensah^2,3,
Jean Tori Pantel ORCID: orcid.org/0000-0002-2674-4660^2,10,
Manuel Holtgrewe¹¹,
Annemarie Bösch¹²,
Claudia Weiß ORCID: orcid.org/0000-0001-6167-5878¹²,
Natalie Weinhold¹²,
Aude-Annick Suter¹²,
Corinna Stoltenburg¹²,
Julia Neugebauer ORCID: orcid.org/0000-0002-2576-1012¹²,
Tillmann Kallinich¹²,
Angela M. Kaindl^13,14,15,
Susanne Holzhauer¹²,
Christoph Bührer¹²,
Philip Bufler¹²,
Uwe Kornak ORCID: orcid.org/0000-0002-4582-9838²,
Claus-Eric Ott ORCID: orcid.org/0000-0003-3627-3791²,
Markus Schülke ORCID: orcid.org/0000-0003-2824-3891²,
Hoa Huu Phuc Nguyen¹⁶,
Sabine Hoffjan¹⁶,
Corinna Grasemann¹⁷,
Tobias Rothoeft¹⁷,
Folke Brinkmann¹⁷,
Nora Matar¹⁷,
Sugirthan Sivalingam ORCID: orcid.org/0000-0001-5239-5137¹,
Claudia Perne¹,
Elisabeth Mangold¹,
Martina Kreiss¹,
Kirsten Cremer¹,
Regina C. Betz ORCID: orcid.org/0000-0001-5024-3623¹,
Martin Mücke¹⁸,
Lorenz Grigull¹⁸,
Thomas Klockgether¹⁹,
Isabel Spier¹,
André Heimbach¹,
Tim Bender¹⁸,
Fabian Brand⁶,
Christiane Stieber¹⁸,
Alexandra Marzena Morawiec¹⁸,
Pantelis Karakostas²⁰,
Valentin S. Schäfer ORCID: orcid.org/0000-0002-6591-5936²⁰,
Sarah Bernsen ORCID: orcid.org/0000-0001-7485-2626¹⁸,
Patrick Weydt¹⁹,
Sergio Castro-Gomez ORCID: orcid.org/0000-0002-1581-474X¹⁹,
Ahmad Aziz ORCID: orcid.org/0000-0001-6184-458X¹⁹,
Marcus Grobe-Einsler¹⁹,
Okka Kimmich¹⁹,
Xenia Kobeleva¹⁹,
Demet Önder¹⁹,
Hellen Lesmann¹,
Sheetal Kumar¹,
Pawel Tacik¹⁹,
Meghna Ahuja Basin⁶,
Pietro Incardona⁶,
Min Ae Lee-Kirsch^21,22,
Reinhard Berner^21,22,
Catharina Schuetz^21,22,
Julia Körholz^21,22,
Tanita Kretschmer^21,22,
Nataliya Di Donato ORCID: orcid.org/0000-0001-9439-4677^21,23,
Evelin Schröck ORCID: orcid.org/0000-0002-3377-1704^21,23,
André Heinen^21,22,
Ulrike Reuner^21,24,
Amalia-Mihaela Hanßke²¹,
Frank J. Kaiser²⁵,
Eva Manka²⁶,
Martin Munteanu²⁵,
Alma Kuechler²⁵,
Kiewert Cordula²⁶,
Raphael Hirtz²⁶,
Elena Schlapakow²⁷,
Christian Schlein ORCID: orcid.org/0000-0002-2791-1790²⁸,
Jasmin Lisfeld²⁸,
Christian Kubisch^28,29,
Theresia Herget²⁸,
Maja Hempel^28,29,30,
Christina Weiler-Normann^29,31,
Kurt Ullrich²⁹,
Christoph Schramm^29,31,
Cornelia Rudolph²⁹,
Franziska Rillig²⁹,
Maximilian Groffmann²⁹,
Ania Muntau³²,
Alexandra Tibelius³⁰,
Eva M. C. Schwaibold ORCID: orcid.org/0000-0003-2708-9642³⁰,
Christian P. Schaaf³⁰,
Michal Zawada³⁰,
Lilian Kaufmann³⁰,
Katrin Hinderhofer³⁰,
Pamela M. Okun ORCID: orcid.org/0000-0001-9981-8918³³,
Urania Kotzaeridou³³,
Georg F. Hoffmann³³,
Daniela Choukair³³,
Markus Bettendorf³³,
Malte Spielmann ORCID: orcid.org/0000-0002-0583-4683³⁴,
Annekatrin Ripke³⁵,
Martje Pauly ORCID: orcid.org/0000-0002-7794-0282^36,37,
Alexander Münchau^35,38,
Katja Lohmann ORCID: orcid.org/0000-0002-5121-1460³⁹,
Irina Hüning³⁴,
Britta Hanker⁴⁰,
Tobias Bäumer^35,38,
Rebecca Herzog ORCID: orcid.org/0000-0002-5906-5947^35,36,
Yorck Hellenbroich⁴¹,
Dominik S. Westphal ORCID: orcid.org/0000-0003-4870-9863⁵,
Tim Strom⁵,
Reka Kovacs⁵,
Korbinian M. Riedhammer ORCID: orcid.org/0000-0002-7503-5801^5,42,
Katharina Mayerhanser⁵,
Elisabeth Graf⁵,
Melanie Brugger ORCID: orcid.org/0000-0002-6920-8550⁵,
Julia Hoefele⁵,
Konrad Oexle⁴³,
Nazanin Mirza-Schreiber ORCID: orcid.org/0000-0003-0836-8267⁴³,
Riccardo Berutti ORCID: orcid.org/0000-0003-1862-3700⁴³,
Ulrich Schatz⁵,
Martin Krenn^5,44,
Christine Makowski⁴⁵,
Heike Weigand⁴⁶,
Sebastian Schröder⁴⁶,
Meino Rohlfs⁴⁶,
Katharina Vill⁴⁶,
Fabian Hauck ORCID: orcid.org/0000-0001-9644-2003⁴⁶,
Ingo Borggraefe ORCID: orcid.org/0000-0002-8484-5945⁴⁶,
Wolfgang Müller-Felber⁴⁶,
Ingo Kurth ORCID: orcid.org/0000-0002-5642-8378¹⁰,
Miriam Elbracht ORCID: orcid.org/0000-0001-5088-1369¹⁰,
Cordula Knopp¹⁰,
Matthias Begemann ORCID: orcid.org/0000-0002-4659-8437¹⁰,
Florian Kraft ORCID: orcid.org/0000-0002-5324-9155¹⁰,
Johannes R. Lemke ORCID: orcid.org/0000-0002-4435-6610^47,48,
Julia Hentschel⁴⁷,
Konrad Platzer ORCID: orcid.org/0000-0001-6127-6308⁴⁷,
Vincent Strehlow⁴⁷,
Rami Abou Jamra ORCID: orcid.org/0000-0002-1542-1399⁴⁷,
Martin Kehrer⁴,
German Demidov ORCID: orcid.org/0000-0001-9075-4276⁴,
Stefanie Beck-Wödl⁴,
Holm Graessner⁴⁹,
Marc Sturm ORCID: orcid.org/0000-0002-6552-8362⁴,
Lena Zeltner⁴⁹,
Ludger J. Schöls⁵⁰,
Janine Magg⁴⁹,
Andrea Bevot⁵¹,
Christiane Kehrer⁵¹,
Nadja Kaiser⁵¹,
Ernest Turro ORCID: orcid.org/0000-0002-1820-6563⁵²,
Denise Horn²,
Annette Grüters-Kieslich⁵³,
Christoph Klein ORCID: orcid.org/0000-0003-0956-0445⁴⁶,
Stefan Mundlos²,
Markus Nöthen ORCID: orcid.org/0000-0002-8770-2464¹,
Olaf Riess ORCID: orcid.org/0000-0002-7011-1369⁴,
Thomas Meitinger ORCID: orcid.org/0000-0002-8838-8403⁵,
Heiko Krude⁵³,
Peter M. Krawitz ORCID: orcid.org/0000-0002-3194-8625⁶^na2,
Tobias Haack ORCID: orcid.org/0000-0001-6033-4836⁴^na2,
Nadja Ehmke^2,3^na2 &
…
Matias Wagner^5,43,46^na2

Nature Genetics volume 56, pages 1644–1653 (2024)Cite this article

17k Accesses
1 Citations
243 Altmetric
Metrics details

Subjects

Abstract

Individuals with ultrarare disorders pose a structural challenge for healthcare systems since expert clinical knowledge is required to establish diagnoses. In TRANSLATE NAMSE, a 3-year prospective study, we evaluated a novel diagnostic concept based on multidisciplinary expertise in Germany. Here we present the systematic investigation of the phenotypic and molecular genetic data of 1,577 patients who had undergone exome sequencing and were partially analyzed with next-generation phenotyping approaches. Molecular genetic diagnoses were established in 32% of the patients totaling 370 distinct molecular genetic causes, most with prevalence below 1:50,000. During the diagnostic process, 34 novel and 23 candidate genotype–phenotype associations were identified, mainly in individuals with neurodevelopmental disorders. Sequencing data of the subcohort that consented to computer-assisted analysis of their facial images with GestaltMatcher could be prioritized more efficiently compared with approaches based solely on clinical features and molecular scores. Our study demonstrates the synergy of using next-generation sequencing and phenotyping for diagnosing ultrarare diseases in routine healthcare and discovering novel etiologies by multidisciplinary teams.

PhenoScore quantifies phenotypic variation for rare genetic diseases by combining facial analysis with other clinical features using a machine-learning framework

Article 07 August 2023

Consensus reporting guidelines to address gaps in descriptions of ultra-rare genetic conditions

Article Open access 06 April 2024

Combining exome/genome sequencing with data repository analysis reveals novel gene–disease associations for a wide range of genetic disorders

Article Open access 19 April 2021

Main

A recent analysis of the Orphanet database showed that around 3–6% of the global population have a rare disease (that is, a disease with a prevalence of <1 in 2,000) and that 72% of such cases may have a genetic cause¹. Rare diseases thus represent a substantial global health burden. However, only a minority of patients suspected to have a rare disease receive both a definite clinical diagnosis and a confirmatory molecular test result^2,3. This concerns in particular the subset of patients with ultrarare disorders that are defined in the European Union as affecting no more than one person in 50,000 and that follow a long tail distribution with respect to their frequency (Regulation (EU) No. 536/2014). It is estimated that roughly 80% of the more than 5,000 rare genetic diseases have a prevalence below one in a million¹.

The International Rare Disease Research Consortium therefore stated that, by 2027, all patients who come to medical attention with a suspected rare or ultrarare disease should be diagnosed within 1 year if the respective disorder has been described in the medical literature⁴. Since many rare diseases are Mendelian in nature, comprehensive genetic testing is a key element to achieve that goal.

In Germany, around 90% of the population has statutory health insurance, and the current reimbursement scheme allows physicians to request chromosome analyses, molecular karyotyping and sequencing of single genes or gene panels. For example, high-resolution genome-wide array-based segmental aneusomy profiling detects a pathogenic aberration in around 19% of patients with developmental delay⁵. Besides contiguous gene syndromes, most of the remaining rare disorders are monogenic and are caused by single nucleotide variants or small insertions or deletions (indels). However, single gene analyses or small gene panels are only likely to detect a pathogenic aberration if the phenotype is highly predictive of the molecular cause, for example, hemoglobinopathies⁶.

For phenotypes with high genetic heterogeneity, such as neurodevelopmental disorders, genetic investigation is more challenging. For intellectual disability, for example, studies so far have identified disease associations for more than a thousand genes⁷. For these disorders, research has shown that exome sequencing can be more cost-effective than sequencing potentially multiple gene panels⁸. However, this is also accompanied by more genetic variants that have to be assessed. Therefore, a clear indication for exome sequencing and efficient data analysis strategies are crucial. Between 2018 and 2020, a novel diagnostic concept within the German healthcare system was evaluated in the prospective study TRANSLATE NAMSE⁹.

This involved standardized structures and procedures and multidisciplinary teams (MDTs) at ten university hospital-based centers for rare diseases (CRDs). The MDTs conducted a three-step diagnostic process: (1) primary review of patient records; (2) selection of diagnostic procedures, including a possible recommendation for exome sequencing; and (3) evaluation of all findings, including genetic variants. A key goal was to investigate whether exome sequencing would facilitate the diagnosis of ultrarare disorders or even the delineation of novel monogenic disorders. In this work, we report the molecular findings of this study.

Furthermore, we investigated how phenotypic features can be used to estimate the probability that a molecular diagnosis can be established with exome sequencing (YieldPred). In a companion study, we also assessed the extent to which the results from computer-assisted pattern recognition in facial dysmorphism contribute to variant interpretation (prioritization of exome data by image analysis, PEDIA). The present analyses demonstrated that exome sequencing facilitated the diagnosis of ultrarare genetic diseases and novel gene–disease associations and that artificial intelligence (AI)-driven technologies improved the diagnostic yield for ultrarare genetic disorders.

Results

Phenotypic characteristics of the study cohort

Between 2018 and 2020, a total of 5,652 individuals (2,033 adults and 3,619 children) with a suspected rare disorder were enrolled in TRANSLATE NAMSE by CRDs at ten German university hospitals (Fig. 1a)⁹. The present analyses were performed using the data from a total of 1,577 of these 5,652 patients (268 adults, 1,309 children). In these individuals, the MDT at the respective CRD considered a genetic cause as plausible and exome sequencing as the most suitable test (exome sequencing cohort, Supplementary Table 1). Each of these 1,577 individuals was assigned to one of six major disease categories by the respective CRD physician (Fig. 1b). The majority of children were assigned to the disease category ‘neurodevelopmental disorders’ (n = 702, 54%), and the largest proportion of adults were assigned to the disease category ‘neurological or neuromuscular disorders’ (n = 117, 44%). Smaller proportions of adult and pediatric cases were assigned to the groups ‘organ malformation’, ‘endocrine/metabolic disorders’, ‘immune/hematologic disorders’ and ‘cardiovascular disorders’. Patient phenotypes were also annotated with terms of the Human Phenotype Ontology (HPO) by the respective CRD physicians. On average, five HPO terms were specified per individual (Supplementary Fig. 1a). The phenotypes within the present cohort were visualized by projecting the patient-specific HPO terms into a two-dimensional space. While most patients from the same disease group were in close proximity, the clusters showed a partial overlap (Fig. 1c). For example, many patients categorized within ‘neurological or neuromuscular disorders’ also showed HPO terms typically associated with ‘neurodevelopmental disorders’ and vice versa (Supplementary Fig. 1b). This suggests that grouping patients into single disease groups may be overly simplistic.

**Fig. 1: Workflow in the TRANSLATE NAMSE project and phenotypes in which exome sequencing was performed.**

Diagnostic yield of exome sequencing

A molecular diagnosis was established in a total of 499 of the 1,577 patients (32%), that is, in these cases, exome sequencing identified variants that fully or partially explained the phenotype. The diagnostic yield was slightly higher in children (32%) than in adults (28%, P = 0.13, Fisher’s exact test; Fig. 2a) and twofold higher in patients assigned to the category ‘neurodevelopmental disorder’ than for all other disease categories (42% versus 22%, P < 0.001, Fisher’s exact test with Bonferroni correction; for single comparisons between disorder groups, see Fig. 2b). Furthermore, exome sequencing found variants of uncertain significance. Specifically, these variants were enriched for missense variants (80% versus 45%, P < 0.001; Supplementary Fig. 2), due to lower support for pathogenicity according to the guidelines of the American College of Medical Genetics (ACMG) and the Association for Molecular Pathologists for interpretation of sequence variants.

De novo variants and parental mosaicism

A total of 228 diagnoses (45% of 510 diagnoses including dual diagnoses) were attributable to de novo variants, making them the most common cause of disease in families with an autozygosity below 0.02 and the second most common cause in families with consanguinity (Fig. 3). In three families with variants that were initially classified as de novo, evidence for probable or certain parental mosaicism was found (Supplementary Note). In one of these families, the same likely pathogenic variant in PUF60 was identified as the cause of developmental delay in two affected brothers. Since the variant was not detectable in the exome data of either parent, gonadal mosaicism could not be confirmed and was instead presumed on the basis of the family history. The detection in the exome sequencing analysis of three probable parental mosaics among 228 patients corresponds to a frequency of 1.3%, which is within the estimated interval of clinically relevant parental mosaicism^10,11,12.

**Fig. 3: Mode of inheritance and disease burden are dependent on autozygosity.**

Recessive disease burden

The second-largest proportion of solved cases involved an autosomal recessive (AR) mode of inheritance (125 solved cases, 14.5% of all diagnoses; Fig. 3a). In total, 94 of the causative variants in the 125 recessive diagnoses in the present cohort would also have been classified as pathogenic if identified in healthy individuals¹³. The diagnostic yield was considerably higher in patients with presumed consanguinity (low autozygosity 31%, n = 1,014 versus high autozygosity 41%, n = 144, P = 0.01, Fisher’s exact test), and the composition of the modes of inheritance also differed significantly between the high- and low-autozygosity groups (Fig. 3b). The relative contribution of homozygous variants was significantly higher in the high-autozygosity group (73% of n = 62 diagnoses) than in the low-autozygosity group (2% of n = 313 diagnoses) (odds ratio (OR) 111.5, P < 0.001, Fisher’s exact test). In contrast, the contribution to disease of de novo variants was 13% (n = 62 diagnoses) in the high-autozygosity group compared with 54% (n = 313 diagnoses) in the low-autozygosity group (OR 0.2, P < 0.001, Fisher’s exact test). Since the de novo mutation count is dependent on parental age but not on autozygosity, the disease prevalence that is attributable to de novo variants should be comparable between both groups and can be used for normalization (Fig. 3c). For an inbreeding coefficient of >2%, this suggests a recessive disease burden that is sevenfold higher than for those with lower inbreeding coefficients, which is consistent with previous reports^14,15,16. However, it also has to be acknowledged that population expansion results in a drop in the prevalence of recessive disorders in random mating populations and that the lower recessive disease burden might be only a transient effect¹⁷.

Dual molecular diagnoses and secondary findings

For 11 individuals, who represented approximately 2% of all solved cases, molecular diagnoses for two distinct or overlapping disease phenotypes were established (Supplementary Table 2). This group showed a tendency for high autozygosity (43%, n = 7 versus 16%, n = 361, P = 0.09, Fisher’s exact test) and recessive disorders (41%, n = 22 diagnoses versus 24%, n = 488 diagnoses, P = 0.08, Fisher’s exact test). The detected percentage of dual diagnoses (2%, 11 of 499 solved cases) is consistent with both the enrichment of high autozygosity and recessive disorders in this group, and earlier reports^18,19.

In 17 individuals who had consented to being informed about secondary findings, we identified medically actionable variants that were unrelated to the present phenotype. The list of 59 actionable genes was based on the ACMG recommendations; however, secondary findings in 7 additional genes were reported following discussions within the respective MDTs (Supplementary Note).

Enrichment of ultrarare diagnoses

For the 499 individuals in whom exome sequencing led to a molecular diagnosis, a total of 549 disease-causing variants were identified in 362 different disease-associated genes as well as structural variants affecting 14 genomic regions (Supplementary Table 1). This plethora of diagnoses suggests that each specific genetic disorder had a very low prevalence. To clarify this, the results were compared with the total number of (likely) pathogenic ClinVar submissions for the respective genes (Fig. 4a). The first quartile of ClinVar variants corresponds to the more frequently identified rare diseases and contains 40,078 variants assigned to 47 genes. In the group of 499 individuals with a molecular diagnosis in the present cohort, only 33 patients and 14 different disease-associated genes fell into this first quartile. In contrast, the majority of the present 499 patients (corresponding to 192 different disorders) were assigned to the fourth quartile, which contains disease genes with the least ClinVar submissions (Fig. 4b). Notably, almost half of the diagnoses assigned in the present cohort were only established in the past decade (Fig. 4c). A comparison with a cohort of comparable size²⁰ revealed a significantly different distribution with respect to the years in which the phenotype was first associated with the respective disease-causing gene (Kolmogorov–Smirnoff test, P < 0.001; Supplementary Figs. 3 and 4).

**Fig. 4: Most variants identified in TRANSLATE NAMSE exome sequencing cohort cause ultrarare disorders that were first associated with a gene in the last decade.**

Novel DGGs and candidates

In cases for which no molecular diagnosis could be established due to variants in the known clinical exome, all potentially deleterious variants in the remaining exome were assessed for plausible novel disease etiologies (see detailed scoring for 57 candidate genes in 65 cases in Methods, Supplementary Note and Supplementary Table 3). Moderate evidence was generated for 23 of 57 candidate genes, and high evidence was generated for the remaining 34. A total of 17 candidate genes with high evidence are currently undergoing further investigation, mostly within the framework of international projects. A total of 17 genes (12 with autosomal dominant inheritance, 5 with autosomal recessive inheritance) have acquired diagnostic-grade gene (DGG) status during the first three years through international cooperation^{21,22,23,24,25,26,27,28,29,30,31,32,33}. After the end of the study, two more candidate genes transitioned to the group of DGGs due to additional phenotypic, functional and statistical evidence became available^32,34.

In comparison with pathogenic variants in previously known disease-associated genes, the present candidate gene set showed a higher proportion of missense variants. This is probably attributable to the fact that the classification of missense variants is more challenging (Supplementary Table 3).

Functional assays

For 18 cases that were classified as uncertain or unsolved after initial exome sequencing, multi-omic assays were performed, that is, an analysis of the methylome (n = 4), proteome (n = 3) or transcriptome (n = 14). Epigenetic signatures, as derived from methylome analyses, clarified the status of de novo missense variants as likely benign in one case and as pathogenic in three. This is exemplified by a case with a missense variant in KMT2D (Supplementary Note)^35,36. Variants in MDH2 were reclassified to pathogenic, on the basis of a proteome analysis of patient-derived fibroblasts (Supplementary Note), while results were inconclusive in two unsolved cases. In 13 unsolved cases, RNA sequencing was performed but could not identify transcriptome alterations that lead to the identification of causative variants. Thus, in 5/18 cases, complementary assays facilitated variant reclassification and highlighted the importance of variant validation strategies in diagnostics for suspected rare genetic diseases (Supplementary Note)^37,38,39.

Predicting the diagnostic yield using machine learning

Analyses were then conducted to investigate whether the phenotype predicted the diagnostic yield of exome sequencing. For this purpose, a least absolute shrinkage and selection operator (LASSO) analysis for binary outcomes was performed. To reduce the phenotypic dimension and to increase interpretability, HPO terms were first aggregated into 49 nonoverlapping phenotypic groups. These phenotypic groups were used as predictors in the LASSO analysis. The resulting model was able to discriminate between solved and unsolved cases (Supplementary Fig. 5a; area under the curve (AUC) 0.67, 95% confidence interval (CI) 0.61–0.74, on a held-out test set of the exome sequencing cohort, n = 321) and yielded the HPO groups ‘dysfunction of higher cognitive abilities’, ‘hematological abnormalities’ and ‘ataxia’ as very influential predictors in terms of the establishment of a molecular diagnosis via exome sequencing (Fig. 5a). To improve the predictions for a wider variety of phenotypic features, we trained on samples of additional cohorts and made the model available as a web service (https://translate-namse.de). YieldPred can now be used to estimate the diagnostic yield of exome sequencing on the basis of the phenotypic features of a given patient and might therefore help in expectation management (Methods and Supplementary Figs. 3, 5 and 6).

**Fig. 5: Machine learning identifies features relevant to the diagnostic yield and can support variant prioritization.**

Variant prioritization using facial image analysis (PEDIA)

A total of 224 of the 1,577 patients had also provided written informed consent for the evaluation of their facial images with the AI tool GestaltMatcher⁴⁰ and the use of the results (gestalt scores) in exome variant interpretation (PEDIA)⁴¹. In 94 of these PEDIA subcohort cases, a molecular diagnosis was established. For 81 of these 94 cases, the gestalt scores improved prioritization results, that is, the correct diagnosis was ranked higher. In general, the PEDIA approach (that is, a combined scoring approach involving genotype-, phenotype- and facial gestalt-based prioritization tools) can contribute to prioritization efficiency, provided that (1) the clinical features of the underlying disorder include facial dysmorphism and (2) molecularly solved cases are already part of the GestaltMatcher Database⁴⁰ (https://db.gestaltmatcher.org/). In the present PEDIA subcohort, for 81 cases, representing 68 different disorders, one or more previously solved cases were phenotypically so similar that the gestalt score for the associated disease gene resulted in a higher ranking for the pathogenic variant than prioritization approaches that do not make use of image analysis.

Four different variant prioritization approaches involving genotype-based and/or phenotype-based scores were analyzed and their respective accuracy rates compared. For the PEDIA approach, the correct disease-associated gene was listed among the top ten suggestions in 82% of the cases. The PEDIA approach outperformed prioritization by either a molecular score (combined annotation-dependent depletion, CADD⁴²) or GestaltMatcher only, as well as the combined molecular and feature score (CADD + case annotation and disorder annotation (CADA)) (Fig. 5b). As the latter can be considered routine in exome sequencing analysis, additional gestalt scores help to improve variant interpretation in diagnostics.

Based on these results and the extension of the TRANSLATE NAMSE study beyond the initial 3 years, the PEDIA workflow was implemented at further sites. The exome sequencing data of another 149 patients were then analyzed. In this additional cohort, a molecular diagnosis was established in 69 patients, and a top-10 accuracy of 83% was achieved using the PEDIA score (Supplementary Fig. 7).

The PEDIA approach is highly modular, and the GestaltMatcher score for image analysis can also be combined with other prioritization tools such as Exomiser⁴³, Xrare⁴⁴, LIRICAL⁴⁵ or Amelie⁴⁶, which use different molecular scores or HPO-based scores. All tested combinations showed improvements in the top-k accuracies and are discussed in Supplementary Note and Supplementary Fig. 8.

In some cases, the gestalt scores were particularly suggestive and facilitated the identification of otherwise challenging pathogenic variants. For instance, in a patient with a very high gestalt score for Koolen de Vries syndrome, a 4.7-kb de novo deletion affecting KANSL1 was detected⁴⁷. Other case reports of particular interest are described in Supplementary Note and Supplementary Fig. 9.

Exemplary diagnoses with targeted therapy

Implications of diagnoses on clinical management were not assessed in a structured way. However, for five patients in the TRANSLATE NAMSE cohort with a molecular diagnosis (1%), individualized treatments or therapies directed against the mechanism of the disease could be initiated⁴⁸. A patient with metachromatic leukodystrophy due to pathogenic variants in arylsulfatase alpha was treated with autologous CD34⁺ cells that were transduced ex vivo using a lentiviral vector encoding arylsulfatase alpha⁴⁹. The gene therapeutic approach with atidarsagene autotemcel has been authorized by European Medicines Agency (EMA) in the European Union since 17 December 2020. A patient with pyruvate dehydrogenase E1-α deficiency due to a de novo variant in PDHA1 and another patient with GLUT1-deficiency due to pathogenic variants in SLC2A1 were treated with a ketogenic diet. In a patient with cerebral creatine deficiency syndrome 1, due to a missense substitution in SLC6A8, supplementation with creatine was started. In a patient with congenital disorder of glycosylation of type IIc, due to a homozygous missense variant in SLC35C1, the fucosylation deficiency was treated by oral fucose supplementation⁵⁰.

Discussion

Reducing the time to diagnosis from several years to less than 1 year is highly relevant in terms of both prognosis and the targeted use of healthcare resources, since the number of approved therapies for rare diseases in which early treatment is associated with better outcomes is now increasing⁵¹. Establishing a molecular diagnosis quickly will require the implementation of frameworks within healthcare systems that are dedicated to patients with rare diseases. The novel diagnostic approach evaluated in TRANSLATE NAMSE was the practical realization of such a concept. The present investigation suggests that a combination of a structured clinical assessment by an MDT, an advanced sequencing test, such as exome sequencing, and a comprehensive discussion of the results reduces diagnostic delay and may improve therapy. These findings are consistent with reports from other healthcare systems and other disorders that benefit from interdisciplinary structures^{20,52,53,54,55,56}. On the basis of the present data, in 2021, exome sequencing was included in the list of standard medical services offered to patients with suspected rare diseases who were referred to German CRDs. For all the patients that are still awaiting a molecular diagnosis, new multi-omics approaches are promising but also costly. Therefore, in a complex healthcare system, these tests compete with other analyses, and their efficiency and efficacy in establishing a diagnosis should be evaluated in the future. However, it will be crucial within the German healthcare system that the inclusion of MDTs in the diagnostic process does not delay or even hinder genetic testing for patients with rare diseases. With exome sequencing being incorporated into an increasing number of guidelines, we also anticipate that the focus of the MDT will shift from test selection toward variant interpretation and identifying therapeutic options. By these means, MDTs operating in CRDs would fulfill a similar purpose for patients with rare disorders as molecular tumor boards in centers for personalized medicine already do for cancer patients⁵⁷.

Two notable findings of the present analyses were that, in comparison with ClinVar and a previously reported rare disease cohort of similar size²⁰, the TRANSLATE NAMSE cohort was significantly enriched for ultrarare disorders (Fig. 4a and Supplementary Fig. 4) and that a large number of recently described gene–disease associations were found^1,8,20,58. In our opinion, this accumulation of ultrarare diagnoses and the relative absence of more common conditions is explained by the study protocol, which required consideration of different test options, including gene panels. Furthermore, the fact that a large number of the established diagnoses have only become possible in recent years as a result of increasing medical genetic knowledge (Fig. 4c) highlights the importance of reanalysis of exome data^59,60. Indeed, the present analyses identified a large number of individuals who carried variants that indicated a novel disease–gene association (12% of solved cases), which highlights the fact that the analysis of exome sequencing data should not be limited to known disease genes. Establishing novel gene–disease associations and conducting functional analyses for the reclassification of variants of uncertain significance are time-consuming and highly complex endeavors⁶¹. Hence, from the present logistical perspective, such analyses are easier to perform in a research context than within the routine diagnostic context of clinical practice. However, these findings are of crucial importance for affected individuals and their families. Thus, from a teleological perspective, in some rare disease cases, boundaries separating diagnostics and research are somewhat blurred. Therefore, in the tertiary, academic setting, collaboration between experts from diagnostics and research is highly relevant for patients with suspected ultrarare diseases and a lack of definitive diagnostic findings.

In several patients from the present cohort, molecular diagnoses also resulted in a change of clinical management to a causal or even curative approach to therapy as described above. These cases emphasize the fact that molecular genetic diagnoses are essential in terms of the development of personalized treatments or therapies that are directed against the underlying disease mechanism. The systematic, consortium-based collection of molecular and clinical data represents the first necessary milestone toward achieving this goal. Particularly in the case of ultrarare disorders, the collection of these data requires additional international collaborative efforts.

Besides the ability to select the appropriate genetic test for diagnosing a disease, a core competence of a clinical geneticist is to estimate disease risk in the offspring of healthy individuals.¹⁷ In addition to the relatedness of the partners, the burden of heterozygous pathogenic variants in recessive genes, which can vary considerably depending on demographics^62,63,64,65, could play an increasingly important role in family planning. In a total of 94 of the 125 cases with recessive molecular diagnoses, the causal variants would also have been classified as (likely) pathogenic if they had been identified in healthy individuals¹³. This also means that, if the parents of pediatric patients with a recessive disorder in the present cohort had undergone exome sequencing to determine their carrier status, three out of four of these couples could have received appropriate genetic counseling concerning disease risk in future offspring, which supports the argument for extended screening⁶⁶.

Another aim of the present study was to determine whether complementary AI and machine learning approaches would facilitate diagnostic effectiveness and efficiency in the exome sequencing cohort. The PEDIA analyses showed that AI-powered next-generation phenotyping increased the efficiency of exome sequencing data analysis. However, not every case in the present cohort was solved via exome sequencing. Therefore, the machine learning model YieldPred was developed to identify features that had a major impact on the diagnostic yield in our and other study cohorts. Prospectively, this approach can also be used for two purposes. First, it can be used to estimate the probability that exome sequencing will result in a molecular diagnosis in each patient with a suspected rare disease and can by these means help to manage expectations. Second, as YieldPred in its current form provides an estimation of the diagnostic yield of exome sequencing and not of an underlying monogenic condition of a certain individual, it can be used to stratify individuals for more comprehensive genetic testing, that is, a low YieldPred score despite a high likelihood of a monogenic disease indicates that transcriptomics, proteomics or genome sequencing could be promising.

It would be desirable for all individuals with a suspected monogenic disorder for whom no definitive diagnosis can yet be established to have the option of participating in large-scale genomic diagnostic and research initiatives. We present TRANSLATE NAMSE as the German framework that organizes diagnostics for patients with ultrarare diseases with a backbone of case conferences in MDTs in academic CRDs. TRANSLATE NAMSE represents the first national-level project for undiagnosed patients in Germany, and the future expansion of the network on both the national and international level is planned.

In summary, the results of the present study demonstrate that our novel, structured diagnostic concept facilitates the identification of ultrarare disorders on a national level, provides undiagnosed patients with the opportunity to participate in international research, and represents a platform for data sharing that facilitates the development of machine learning and AI tools to improve the diagnostic yield.

Methods

Enrollment, research ethics and consent

A detailed description of the TRANSLATE NAMSE project is provided elsewhere^9,70. In brief, participants for TRANSLATE NAMSE were recruited between January 2018 and December 2020 from a total of ten German CRDs (Berlin, Bochum, Bonn, Dresden, Duisburg/Essen, Hamburg, Heidelberg, Kiel/Lübeck, München and Tübingen). Overall coordination of the recruitment process was performed by the Institute of Public Health Berlin. This study is governed by the approval of the following institutional review boards: Charité – Universitätsmedizin Berlin, Germany (EA2/140/17); UKB Universitätsklinikum Bonn, Germany (Lfd.Nr.386/17); Universitätsklinikum Essen, University Duisburg-Essen, Germany (17-7774-BO); Universitätsklinikum Heidelberg, Germany (S-499/2017); Universitätsklinikum Tübingen, Germany (643/2017BO1); Universität zu Lübeck, Germany (17-272); Ludwig-Maximilians-Universität München, Germany (17-640); Ärztekammer Hamburg, Germany (MC-316/17); Technische Universität Dresden, Germany (AK 464122017). All patients or their legal guardians provided written informed consent before inclusion. The inclusion criteria for TRANSLATE NAMSE were the lack of a definitive diagnosis and the clinical suspicion of a rare disease. The medical records and family history of each individual were evaluated by a MDT, which comprised at least board-certified physicians of two specialities with domain-specific expertise. For each individual, the respective MDT then made recommendations concerning diagnostics and further clinical management. To make the recommendation of exome sequencing, a board-certified human geneticist was additionally required within the MDT. For example, strong criteria for the indication of exome sequencing were congenital malformations, a syndromic phenotype, a positive family history suggestive of a monogenic disease and lack of absence of an alternative test with a comparable suspected diagnostic yield. A total of 1,577 patients (268 adult and 1,309 pediatric) from the TRANSLATE NAMSE cohort were referred for exome sequencing on the recommendation of the MDT at the respective CRD (exome sequencing cohort). The phenotypic and molecular genetic data of these 1,577 patients were evaluated in the present analyses.

Clinical and laboratory phenotype data

Clinical and laboratory phenotype data were transferred to the sequencing laboratory in the form of hard-copy case report forms or as online data capture applications (Face2Gene Clinic). Online data capture allowed the free entry of HPO terms. Data from hard-copy report forms and free-text entries were transformed into HPO terms. The phenotypes reported in the present study are those that were reported to the sequencing laboratories. On the basis of the leading presenting clinical feature, each case was assigned to one of six major disease groups (Supplementary Fig. 1b). This allowed a more definitive statement on diagnostic yield in relation to the clinical features of the patient. In the subsequent analyses, all assigned HPO terms (n = 1,649) were compiled and divided into higher-order groups (n = 12) and subcategories (n = 49) by expert clinicians. Therefore, patients were additionally assigned to at least one higher-order group as well as at least one subgroup. To assign a patient to an HPO-defined group, the patient had to have at least one of the HPO terms belonging to the respective group. The following higher-order groups were defined: 1, neurodevelopmental; 2, neuromuscular; 3, seizures; 4, growth disorders; 5, facial dysmorphism; 6, abnormality of connective tissue; 7, congenital malformations; 8, endocrine and metabolic abnormalities; 9, immune and hematological abnormalities; 10, sensory organ alterations; 11, abnormal findings on brain magnetic resonance imaging; 12, others. Within the respective higher-order groups, HPO terms were further assigned to subcategories (n = 49) (https://github.com/Ax-Sch/TNAMSE_geno_pheno/blob/main/resources/hpo_categorization_19_12_2022.tsv).

DNA sequencing

Details on DNA sequencing for each sequencing laboratory are given in Supplementary Table 4. Trio sequencing was conducted for 58% of the cases. When additional informative relatives were available, these were also included in the analysis as permitted by German law (healthy minors were not analyzed). EDTA-treated whole-blood samples or saliva kits were delivered to one of the five participating sequencing centers (Berlin, Bonn, LMU Munich, Munich or Tuebingen) for further processing. After DNA extraction, fragment size and purity were assessed. If the DNA fulfilled all quality criteria, the sample was submitted for sequencing. Exome sequencing was performed on exon targets that were isolated using capture and either Agilent SureSelect Human All Exon kits v6 or v7 (Agilent Technologies), or the Human Core Exome Kit (Twist Bioscience). One microgram of DNA was sheared into 350–400-bp fragments, which were then repaired, ligated to adaptors and purified for subsequent polymerase chain reaction amplification. Amplified products were then captured by biotinylated RNA library baits in solution, in accordance with the manufacturer’s instructions. Bound DNA was isolated with streptavidin-coated beads and reamplified. The final isolated products were sequenced using the Illumina NextSeq 500, NextSeq 550, HiSeq 2500 or NovaSeq 6000 sequencing system and 2 × 100-bp paired-end reads (Illumina). All five sequencing centers ensured a coverage of over 20× in over 95% of the RefSeq target region.

Exome sequencing data-processing pipeline

Details on exome sequencing data processing for each sequencing laboratory are given in Supplementary Table 4. At each of the five sequencing centers, exome sequencing processing pipelines were established according to best practice guidelines. The DNA sequence was mapped to the published human genome build GRCh37 reference sequence using Burrows–Wheeler Aligner (BWA). The most up-to-date version at the time of sequencing was used, progressing from BWA v0.7.11 through to BWA-Mem v0.7.17^71,72. Single nucleotide variants and small indels were detected with HaplotypeCaller (v3.7, v3.8 or v4.1; three laboratories, 40.0% of cases), Freebayes (v1.2.0, one laboratory, 16.6% of cases) or HaplotypeCaller as well as SAMtools v.0.1.7 (one laboratory, 43.4% of cases)^73,74. Mitochondrial DNA variants were assessed using data from exome sequencing in three laboratories (80% of cases)⁷⁵. Copy number variations were detected using ExomeDepth or ClinCNN on short-read data (two laboratories, 60.0% of cases), before exome sequencing by array CGH (two laboratories, 30.0% of cases) or not evaluated (one laboratory, 10.1%)^76,77. Additionally, analysis for structural variants was only conducted by one laboratory (16.6% of cases). Analysis for uniparental disomy was performed in two sequencing laboratories (60.0% of cases) using the UpdHunter function of ngs-bits v2019_09 (https://github.com/imgag/ngs-bits) or custom scripts. Finally, analyses for mosaic variants were conducted by four laboratories (90% of cases).

Variants were annotated using VEP (four laboratories, 80.2% of cases)⁷⁸ or Jannovar (one laboratory, 19.8% of cases)⁷⁹ and analyzed in VarFish⁸⁰, megaSAP (https://github.com/imgag/megSAP) or EVAdb (https://github.com/mri-ihg/EVAdb) or in tabular format depending on the center. Virtual gene panels were used in four out of five sequencing sites (56.7% of cases). In the sequencing site where no virtual panels were used, a similar approach (HPO-based and Online Mendelian Inheritance in Man (OMIM) full-text search) was used. Additionally, filter parameters specific for assumed modes of inheritances were applied (all laboratories; mainly cutoffs of allele frequencies or counts in the population database gnomAD).

The population background of each individual was estimated with peddy⁸¹. This revealed that the cohort was of predominantly European origin (Supplementary Table 1 and Supplementary Fig. 10).

Autozygosity was estimated using RohHunter, bcftools/roh or a sliding-window framework^82,83,84. A small subset of samples was run on all three tools, and this yielded comparable results for autozygosity. A threshold of 2% was used to assign patients to a high- or a low-autozygosity group¹⁴ (Supplementary Fig. 11).

The variants identified in exome sequencing were assessed in accordance with the standards and guidelines of the ACMG for the interpretation of sequence variants⁸⁵. At least two physicians or experts in molecular genetics participated in the assessment of the variants. Finally, all variants that were potentially disease-causing (pathogenicity class 3–5) and actionable secondary findings were reported to the respective patients.

Cases in which no diagnosis could be established in a known disease-associated gene were included in national and international studies for the discovery of novel disease etiologies for example, via the MatchMaker Exchange network^86,87. Variants with a high likelihood of being disease-causing, for example, those with loss of function or high pathogenicity scores, or those that had arisen de novo, were shared through MatchMaker Exchange or a similar network in order to identify similar patients^88,89.

Statistical analyses

All statistical analyses were conducted in R (version 4.2.2)⁹⁰. Proportions were tested using a two-sided Fisher’s exact test. The significance level was set to α = 0.05, and P values were corrected via Bonferroni correction if necessary.

Visualization of phenotype space using UMAP

First, data on known diseases and their clinical features were downloaded from the HPO website (https://hpo.jax.org/app/download/annotation, file: genes_to_phenotype.txt, downloaded on 10 April 2021). The disease data were merged with the data of the 1,577 individuals from TRANSLATE NAMSE by treating each disease–ID as one individual. Similarities in HPO terms between all pairs of individuals were then calculated using the R package ontologySimilarity (version 2.5). The similarities were then converted to a distance matrix and projected into a four-dimensional space using uniform manifold approximation and projection (UMAP). Subsequently, the first two dimensions of this projection were plotted using ggplot2 (version 3.3.4).

Variants amenable to carrier screening

In cases with autosomal recessive inheritance, disease-causing variants in ClinVar were queried in January 2017 (beginning of the project) to take into account the state of knowledge available at the time of analysis. Variants were classified as amenable to carrier screening if they were classified as pathogenic or likely pathogenic in ClinVar or if they were predicted loss-of-function variants that were not predicted to escape nonsense-mediated messenger RNA decay. In compound-heterozygous inheritance, both variants were required to be (likely) pathogenic.

Comparison of disease-associated genes reported in TRANSLATE NAMSE with those reported in other cohorts

In the German healthcare system, genetic testing of the more frequent rare disorders, for example, retinitis pigmentosa or hearing impairment, is performed using gene panels.

For a comparison with the cohort from the NIHR BioResource described in Turro et al.²⁰, all disease-associated genes were first ranked according to the frequency of submissions of pathogenic and likely pathogenic variants to ClinVar. Disorders caused by genes in the first quartile of the ClinVar gene distribution, such as USH2A, ABCA4 and BMPR2, are more prevalent than phenotypes associated with genes in the fourth quartile. In addition, the year in which phenotype–gene associations had first been reported was determined to assess when a diagnosis could first have been established. The characteristics of the variants identified in the TRANSLATE NAMSE exome sequencing cohort were then compared with those identified in a cohort reported by Turro et al. in 2020.

Turro et al. subjected DNA from 9,802 individuals with a suspected rare disease to genome sequencing and reported pathogenic or likely pathogenic variants in 1,138 cases²⁰. Around a quarter of these variants were assigned to genes with a high disease prevalence (Supplementary Fig. 4). In contrast, most disease-associated genes identified in the TRANSLATE NAMSE cohort were ultrarare, and more frequent diagnoses were underrepresented.

Novel disease candidate genes

Sequence data from the unsolved cases were analyzed for variants in potential novel disease candidate genes. The following mandatory criteria for novel disease candidate genes were defined: (1) the gene had shown no previous robust association with any human phenotype; (2) no other clearly causative disease explanation was found; (3) the allele frequency of the respective variant was below the minor allele frequency cutoff or the variant was absent in controls; (4) inheritance was in accordance with the phenotype in the family and/or the variant co-segregated with the disease in multiple affected family members. As in the ClinGen approach and as suggested by others, characteristics, including gnomAD constraint metrics, inheritance and functional data, by which the level of evidence for the manually identified candidate genes could be assessed were defined^61,91,92 (Supplementary Table 3). An evidence score was then calculated, which could reach a maximum value of 8. Three of the nine criteria can only be applied to genes with an autosomal dominant mode of inheritance (de novo status and gnomAD constraint metrics), rendering the score less informative for autosomal recessive inheritance. For autosomal dominant inheritance, a score of 1–3 was ranked as medium evidence and a score of 4 and above as high evidence. For recessive inheritance, a score of 3 or above was ranked as high evidence and a score of below 3 was ranked as medium evidence. Genes first published as disease-associated during the course of TRANSLATE NAMSE were classified as novel DGG.

Diagnostic yield prediction (YieldPred)

The TRANSLATE NAMSE exome sequencing cohort (n = 1,577) was randomly divided into a training set comprising 1,256 cases (399 solved, 32%) and a test set comprising 321 cases (99 solved, 31%). The binary status of a case (1, solved; 0, unsolved) was regressed on the 49 HPO-defined subcategories (cf. clinical and laboratory phenotype data) using LASSO for binary outcomes with the logit function as a link function (R package glmnet, version 4.1-4) and by controlling for age (adult/child), sex (male/female), sequencing laboratory and the use of the PEDIA workflow. Variable selection was applied on the 49 HPO-defined subcategories only. The model was fitted on the training set, and the penalty parameter was tuned via tenfold cross-validation. The resulting model was then applied to the test set, and its predictive performance was evaluated using the receiver operator characteristics curve.

We further validated the influence of the separate HPO terms on the model. Figure 5 shows the resulting coefficient plot and was checked for plausibility. We found a positive correlation between the number of HPO terms and the predicted probability on the complete TRANSLATE NAMSE exome sequencing cohort (n = 1,577; Supplementary Fig. 6). Since the approach of HPO-defined subcategories ensures that multiple lower-order terms are only counted once, this finding indicates that a monogenic cause and diagnosis via exome sequencing is more likely if a patient exhibits a diverse set of clinical features. Furthermore, we investigated the discriminatory power of all 1,649 unique HPO terms that were annotated in the TRANSLATE NAMSE cohort. Considering each HPO term separately to discriminate between solved and unsolved patients led to an average AUC of 0.5 (s.d. 0.003), that is, no discriminatory power. The maximum achieved AUC of a single HPO term, namely HP:0001263 (global developmental delay), was 0.58. As a sensitivity analysis, we then fitted a logistic regression on the complete TNAMSE cohort with the top five HPO terms, namely HP:0001263 (global developmental delay), HP:0000252 (microcephaly), HP:0001252 (hypotonia), HP:0001250 (seizure) and HP:0001251 (ataxia), and achieved an AUC of 0.64 (95% CI 0.61–0.67). On the complete TNAMSE set (that is, training and test set combined) our YieldPred model yielded an AUC of 0.72 (95% CI 0.69–0.74). In summary, there are some HPO terms that have higher discriminatory power than the majority of the HPO terms. However, the signal of YieldPred is additionally driven by the combination of multiple phenotypic features that are present in a patient.

To increase the portability and applicability of the Lasso model, two additional external and independent cohorts were included. This first external cohort (n = 753, 545 solved, 72%; Supplementary Table 5) was recruited by the Technical University of Munich, and all individuals consented in the scientific use of their phenotype and genotype data. As a second external cohort, we used the NIHR BioResource cohort described by Turro et al. (n = 5,510, 1,059 solved, 19%). The Lasso model was then retrained on cases of all three cohorts and 20% of the cases of each cohort were kept as hold-out test set. The AUCs of the final model ranged from 0.64 for the TRANSLATE NAMSE cases of the test set and 0.65 for the Munich cases of the test set to 0.71 for the cases of the test set from the cohort of Turro et al. (Supplementary Fig. 5). The final model was provided as the tool YieldPred as a web service, where users can specify the age, sex and assigned HPO terms of their patient, while the remaining confounders are estimated via the mean confounder values of the training cohort.

PEDIA analysis

PEDIA integrated the facial image and clinical feature analysis with exome data analysis⁴¹. For each patient, a frontal facial image, clinical features encoded in HPO terminology, and exome sequencing data were available for analysis.

The PEDIA approach was used, in which the facial image analysis was analyzed by GestaltMatcher⁴⁰. GestaltMatcher was trained on 6,354 frontal images with 204 different disorders to learn the respective facial dysmorphic features, and it further encoded each image into a 512-dimensional facial phenotype descriptor. The model ensembles and test-time augmentation were later used to generate 12 512-dimensional facial phenotype descriptors for each image⁹³. The similarity between two patients can be quantified by averaging 12 cosine distances of the facial phenotype descriptors. For each test image, a list of similarity scores for 816 disease-causing genes were obtained. To convert HPO terms of individual patients into feature scores for each gene, the CADA approach was used⁶⁹. For the exome data, each variant was annotated with a version 1.6 CADD score⁴². After filtering out the common variants, the highest CADD score for each gene was taken.

In this analysis, benchmarking was performed on two cohorts: the PEDIA subcohort and the validation cohort. The PEDIA subcohort consisted of a subset of 224 of the 1,577 exome sequencing patients (194 pediatric, 30 adult). Of these, 94 had a molecular genetic diagnosis (86 pediatric, 8 adult). After the end of the 3-year TRANSLATE NAMSE recruitment period, a further 149 patients were enrolled and used as a validation cohort. In the validation cohort, 69 out of 149 patients were solved cases. All facial images analyzed in the present study can be accessed in GestaltMatcher Database (https://db.gestaltmatcher.org/) by the GMDB ID in Supplementary Tables 1 and 6. For each patient, each gene had a GestaltMatcher score, a CADA score and a CADD score. These three scores were the input of the PEDIA approach. The output for each patient was a list of genes, and each gene had a PEDIA score. The genes were then prioritized by ranking the PEDIA scores in descending order. To benchmark the performance, top-k accuracy was used, as calculated by the percentage of the patients with the disease-causing gene ranked in the top-k position. Finally, the top-1 to top-100 accuracies of the two cohorts (the PEDIA subcohort of the exome sequencing cohort and validation cohort) were reported.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The corresponding author agrees to fulfill any requests for materials not included in the article, subject to verification that the request adheres to the consent provided by the research participants. Patient-related data not included in the article may be subject to patient confidentiality. Raw sequencing data were not consented for sharing, except for the PEDIA subset, which is available upon request. Reported alleles and their clinical interpretation have been deposited in ClinVar using the following submitters: Institute for Genomic Statistics and Bioinformatics (University Hospital Bonn) (https://www.ncbi.nlm.nih.gov/clinvar/submitters/507028/, https://www.ncbi.nlm.nih.gov/clinvar/submitters/508040/); Institute of Human Genetics, Klinikum rechts der Isar (Technical University Munich) (https://www.ncbi.nlm.nih.gov/clinvar/submitters/500240/); Institute for Medical Genetics and Human Genetics (Charité – Universitätsmedizin Berlin) (https://www.ncbi.nlm.nih.gov/clinvar/submitters/505735/); Institute of Medical Genetics and Applied Genomics (University Hospital Tübingen) (https://www.ncbi.nlm.nih.gov/clinvar/submitters/506385/); and Genomics Facility (Ludwig-Maximilians-Universität München) (https://www.ncbi.nlm.nih.gov/clinvar/submitters/507363/).

Code availability

The study’s landing page (https://www.translate-namse.de) redirects to a web service for the prediction of the diagnostic yield and the code repository at GitHub (https://github.com/Ax-Sch/TNAMSE_geno_pheno). Code is also available via Zenodo at https://doi.org/10.5281/zenodo.10964188 (ref. ⁹⁴). All source codes are available under a creative commons license.

References

Nguengang Wakap, S. et al. Estimating cumulative point prevalence of rare diseases: analysis of the Orphanet database. Eur. J. Hum. Genet. 28, 165–173 (2020).
Article PubMed Google Scholar
Blöß, S. et al. Diagnostic needs for rare diseases and shared prediagnostic phenomena: results of a German-wide expert Delphi survey. PLoS ONE 12, e0172532 (2017).
Article PubMed PubMed Central Google Scholar
Boycott, K. M. et al. International cooperation to enable the diagnosis of all rare genetic diseases. Am. J. Hum. Genet. 100, 695–705 (2017).
Article CAS PubMed PubMed Central Google Scholar
Austin, C. P. et al. Future of rare diseases eesearch 2017–2027: an IRDiRC Perspective. Clin. Transl. Sci. 11, 21–27 (2018).
Article PubMed Google Scholar
Hochstenbach, R. et al. Array analysis and karyotyping: workflow consequences based on a retrospective study of 36,325 patients with idiopathic developmental delay in the Netherlands. Eur. J. Med. Genet. 52, 161–169 (2009).
Article PubMed Google Scholar
Choi, H. S. et al. Molecular diagnosis of hereditary spherocytosis by multi-gene target sequencing in Korea: matching with osmotic fragility test and presence of spherocyte. Orphanet J. Rare Dis. 14, 114 (2019).
Article PubMed PubMed Central Google Scholar
Kochinke, K. et al. Systematic phenomics analysis deconvolutes genes mutated in intellectual disability into biologically coherent modules. Am. J. Hum. Genet. 98, 149–164 (2016).
Article CAS PubMed PubMed Central Google Scholar
100,000 Genomes Project Pilot Investigatorset al. 100,000 Genomes pilot on rare-disease diagnosis in health care—preliminary report. N. Engl. J. Med. 385, 1868–1880 (2021).
Article Google Scholar
Rillig, F., Grüters, A., Schramm, C. & Krude, H. The interdisciplinary diagnosis of rare diseases: results of the TRANSLATE-NAMSE project. Dtsch. Arztebl. Int. 119, 469–475 (2022).
PubMed PubMed Central Google Scholar
Cao, Y. et al. A clinical survey of mosaic single nucleotide variants in disease-causing genes detected by exome sequencing. Genome Med. 11, 48 (2019).
Article PubMed PubMed Central Google Scholar
Gambin, T. et al. Low-level parental somatic mosaic SNVs in exomes from a large cohort of trios with diverse suspected Mendelian conditions. Genet. Med. 22, 1768–1776 (2020).
Article CAS PubMed PubMed Central Google Scholar
Wright, C. F. et al. Clinically-relevant postzygotic mosaicism in parents and children with developmental disorders in trio exome sequencing data. Nat. Commun. 10, 2985 (2019).
Article CAS PubMed PubMed Central Google Scholar
Landrum, M. J. et al. ClinVar: improvements to accessing data. Nucleic Acids Res. 48, D835–D844 (2020).
Article CAS PubMed Google Scholar
Martin, H. C. et al. Quantifying the contribution of recessive coding variation to developmental disorders. Science 362, 1161–1164 (2018).
Article CAS PubMed PubMed Central Google Scholar
Fridman, H. et al. The landscape of autosomal-recessive pathogenic variants in European populations reveals phenotype-specific effects. Am. J. Hum. Genet. 108, 608–619 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hu, H. et al. Genetics of intellectual disability in consanguineous families. Mol. Psychiatry 24, 1027–1039 (2019).
Article CAS PubMed Google Scholar
La Rocca, L. A. et al. Understanding recessive disease risk in multi-ethnic populations with different degrees of consanguinity. Am. J. Med. Genet. A 194, e63452 (2024).
Article PubMed Google Scholar
Posey, J. E. et al. Resolution of disease phenotypes resulting from multilocus genomic variation. N. Engl. J. Med. 376, 21–31 (2017).
Article CAS PubMed Google Scholar
Mitani, T. et al. High prevalence of multilocus pathogenic variation in neurodevelopmental disorders in the Turkish population. Am. J. Hum. Genet. 108, 1981–2005 (2021).
Article CAS PubMed PubMed Central Google Scholar
Turro, E. et al. Whole-genome sequencing of patients with rare diseases in a national health system. Nature 583, 96–102 (2020).
Article CAS PubMed PubMed Central Google Scholar
Körholz, J. et al. Novel mutation and expanding phenotype in IRF2BP2 deficiency. Rheumatology 62, 1699–1705 (2023).
Article PubMed Google Scholar
Mochel, F. et al. Variants in the SK2 channel gene (KCNN2) lead to dominant neurodevelopmental movement disorders. Brain 143, 3564–3573 (2020).
Article PubMed Google Scholar
Magg, T. et al. Heterozygous OAS1 gain-of-function variants cause an autoinflammatory immunodeficiency. Sci. Immunol. 6, eabf9564 (2021).
Article CAS PubMed PubMed Central Google Scholar
den Hoed, J. et al. Mutation-specific pathophysiological mechanisms define different neurodevelopmental disorders associated with SATB1 dysfunction. Am. J. Hum. Genet. 108, 346–356 (2021).
Article Google Scholar
Li, D. et al. Pathogenic variants in SMARCA5, a chromatin remodeler, cause a range of syndromic neurodevelopmental features. Sci. Adv. 7, eabf2066 (2021).
Article CAS PubMed PubMed Central Google Scholar
Thaventhiran, J. E. D. et al. Whole-genome sequencing of a sporadic primary immunodeficiency cohort. Nature 583, 90–95 (2020).
Article CAS PubMed PubMed Central Google Scholar
Vogt, G. et al. Biallelic truncating variants in ATP9A cause a novel neurodevelopmental disorder involving postnatal microcephaly and failure to thrive. J. Med. Genet. 59, 662–668 (2022).
Article CAS PubMed Google Scholar
Stenton, S. L. et al. Impaired complex I repair causes recessive Leber’s hereditary optic neuropathy. J. Clin. Invest. 131, e138267 (2021).
Article CAS PubMed PubMed Central Google Scholar
Horn, D. et al. Biallelic truncating variants in MAPKAPK5 cause a new developmental disorder involving neurological, cardiac, and facial anomalies combined with synpolydactyly. Genet. Med. 23, 679–688 (2021).
Article CAS PubMed Google Scholar
Brugger, M. et al. A homozygous truncating variant in CCDC186 in an individual with epileptic encephalopathy. Ann. Clin. Transl. Neurol. 8, 278–283 (2021).
Article CAS PubMed Google Scholar
Marafi, D. et al. A reverse genetics and genomics approach to gene paralog function and disease: Myokymia and the juxtaparanode. Am. J. Hum. Genet. 109, 1713–1723 (2022).
Article CAS PubMed PubMed Central Google Scholar
Ebstein, F. et al. PSMC3 proteasome subunit variants are associated with neurodevelopmental delay and type I interferon production. Sci. Transl. Med. 15, eabo3189 (2023).
Article CAS PubMed PubMed Central Google Scholar
Richard, E. M. et al. Bi-allelic variants in SPATA5L1 lead to intellectual disability, spastic-dystonic cerebral palsy, epilepsy, and hearing loss. Am. J. Hum. Genet. 108, 2006–2016 (2021).
Article CAS PubMed PubMed Central Google Scholar
Liu, Z. et al. Hemizygous variants in protein phosphatase 1 regulatory subunit 3F (PPP1R3F) are associated with a neurodevelopmental disorder characterized by developmental delay, intellectual disability and autistic features. Hum. Mol. Genet. 32, 2981–2995 (2023).
Article CAS PubMed PubMed Central Google Scholar
Aref-Eshghi, E. et al. Genomic DNA methylation signatures enable concurrent diagnosis and clinical genetic variant classification in neurodevelopmental syndromes. Am. J. Hum. Genet. 102, 156–174 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mirza-Schreiber, N. et al. Blood DNA methylation provides an accurate biomarker of KMT2B-related dystonia and predicts onset. Brain 145, 644–654 (2022).
Article PubMed Google Scholar
Cummings, B. B. et al. Improving genetic diagnosis in Mendelian disease with transcriptome sequencing. Sci. Transl. Med. 9, eaal5209 (2017).
Article PubMed PubMed Central Google Scholar
Murdock, D. R. et al. Transcriptome-directed analysis for Mendelian disease diagnosis overcomes limitations of conventional genomic testing. J. Clin. Invest. 131, e141500 (2021).
Article CAS PubMed PubMed Central Google Scholar
Frésard, L. et al. Identification of rare-disease genes using blood transcriptome sequencing and large control cohorts. Nat. Med. 25, 911–919 (2019).
Article PubMed PubMed Central Google Scholar
Hsieh, T.-C. et al. GestaltMatcher facilitates rare disease matching using facial phenotype descriptors. Nat. Genet. 54, 349–357 (2022).
Article CAS PubMed PubMed Central Google Scholar
Hsieh, T.-C. et al. PEDIA: prioritization of exome data by image analysis. Genet. Med. 21, 2807–2814 (2019).
Article PubMed PubMed Central Google Scholar
Kircher, M. et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 46, 310–315 (2014).
Article CAS PubMed PubMed Central Google Scholar
Robinson, P. N. et al. Improved exome prioritization of disease genes through cross-species phenotype comparison. Genome Res. 24, 340–348 (2014).
Article CAS PubMed PubMed Central Google Scholar
Li, Q., Zhao, K., Bustamante, C. D., Ma, X. & Wong, W. H. Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis. Genet. Med. 21, 2126–2134 (2019).
Article PubMed PubMed Central Google Scholar
Robinson, P. N. et al. Interpretable clinical genomics with a likelihood ratio paradigm. Am. J. Hum. Genet. 107, 403–417 (2020).
Article CAS PubMed PubMed Central Google Scholar
Birgmeier, J. et al. AMELIE speeds Mendelian diagnosis by matching patient phenotype and genotype to primary literature. Sci. Transl. Med. 12, eaau9113 (2020).
Article PubMed PubMed Central Google Scholar
Brand, F. et al. Next-generation phenotyping contributing to the identification of a 4.7 kb deletion in KANSL1 causing Koolen-de Vries syndrome. Hum. Mutat. 43, 1659–1665 (2022).
Article CAS PubMed Google Scholar
Bick, D. et al. An online compendium of treatable genetic disorders. Am. J. Med. Genet. C 187, 48–54 (2021).
Article Google Scholar
Capotondo, A. et al. Safety of arylsulfatase A overexpression for gene therapy of metachromatic leukodystrophy. Hum. Gene Ther. 18, 821–836 (2007).
Article CAS PubMed Google Scholar
Feichtinger, R. G. et al. A spoonful of L-fucose-an efficient therapy for GFUS-CDG, a new glycosylation disorder. EMBO Mol. Med. 13, e14332 (2021).
Article CAS PubMed PubMed Central Google Scholar
Tambuyzer, E. et al. Therapies for rare diseases: therapeutic modalities, progress and challenges ahead. Nat. Rev. Drug Discov. 19, 93–111 (2020).
Article CAS PubMed Google Scholar
Stark, Z. et al. Prospective comparison of the cost-effectiveness of clinical whole-exome sequencing with that of usual care overwhelmingly supports early use and reimbursement. Genet. Med. 19, 867–874 (2017).
Article PubMed Google Scholar
Retterer, K. et al. Clinical application of whole-exome sequencing across clinical indications. Genet. Med. 18, 696–704 (2016).
Article CAS PubMed Google Scholar
Kingsmore, S. F. et al. A randomized, controlled trial of the analytic and diagnostic performance of singleton and trio, rapid genome and exome sequencing in ill infants. Am. J. Hum. Genet. 105, 719–733 (2019).
Article CAS PubMed PubMed Central Google Scholar
Benito-Lozano, J. et al. Diagnostic process in rare diseases: determinants associated with diagnostic delay. Int. J. Environ. Res. Public Health 19, 6456 (2022).
Article PubMed PubMed Central Google Scholar
Benito-Lozano, J., López-Villalba, B., Arias-Merino, G., Posada de la Paz, M. & Alonso-Ferreira, V. Diagnostic delay in rare diseases: data from the Spanish rare diseases patient registry. Orphanet J. Rare Dis. 17, 418 (2022).
Article PubMed PubMed Central Google Scholar
Illert, A. L. et al. The german network for personalized medicine to enhance patient care and translational research. Nat. Med. 29, 1298–1301 (2023).
Article CAS PubMed Google Scholar
Kaplanis, J. et al. Evidence for 28 genetic disorders discovered by combining healthcare and research data. Nature 586, 757–762 (2020).
Article CAS PubMed PubMed Central Google Scholar
Wright, C. F. et al. Evaluating variants classified as pathogenic in ClinVar in the DDD Study. Genet. Med. 23, 571–575 (2021).
Article CAS PubMed Google Scholar
Wright, C. F. et al. Making new genetic diagnoses with old data: iterative reanalysis and reporting from genome-wide data in 1,133 families with developmental disorders. Genet. Med. 20, 1216–1223 (2018).
Article PubMed PubMed Central Google Scholar
MacArthur, D. G. et al. Guidelines for investigating causality of sequence variants in human disease. Nature 508, 469–476 (2014).
Article CAS PubMed PubMed Central Google Scholar
Gao, Z., Waggoner, D., Stephens, M., Ober, C. & Przeworski, M. An estimate of the average number of recessive lethal mutations carried by humans. Genetics 199, 1243–1254 (2015).
Article PubMed PubMed Central Google Scholar
Narasimhan, V. M. et al. Health and population effects of rare gene knockouts in adult humans with related parents. Science 352, 474–477 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chakraborty, R. & Chakravarti, A. On consanguineous marriages and the genetic load. Hum. Genet. 36, 47–54 (1977).
Article CAS PubMed Google Scholar
La Rocca, L. A. et al. Understanding recessive disease risk in multi-ethnic populations with different degrees of consanguinity. Am. J. Med. Genet. A 194, e63452 (2024).
Article PubMed Google Scholar
Antonarakis, S. E. Carrier screening for recessive disorders. Nat. Rev. Genet. 20, 549–561 (2019).
Article CAS PubMed Google Scholar
Kalia, S. S. et al. Recommendations for reporting of secondary findings in clinical exome and genome sequencing, 2016 update (ACMG SF v2.0): a policy statement of the American College of Medical Genetics and Genomics. Genet. Med. 19, 249–255 (2017).
Article PubMed Google Scholar
Rentzsch, P., Schubach, M., Shendure, J. & Kircher, M. CADD-splice-improving genome-wide variant effect prediction using deep learning-derived splice scores. Genome Med. 13, 31 (2021).
Article CAS PubMed PubMed Central Google Scholar
Peng, C. et al. CADA: phenotype-driven gene prioritization based on a case-enriched knowledge graph. NAR Genom. Bioinform. 3, lqab078 (2021).
Article PubMed PubMed Central Google Scholar
Choukair, D. et al. An Integrated clinical pathway for diagnosis, treatment and care of rare diseases: model, operating procedures, and results of the project TRANSLATE-NAMSE funded by the German Federal Joint Committee. Orphanet J. Rare Dis. 16, 474 (2021).
Article PubMed PubMed Central Google Scholar
Vasimuddin, M., Misra, S., Li, H. & Aluru, S. Efficient Architecture-Aware Acceleration of BWA-MEM for Multicore Systems. In 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 314–324 (IPDPS, 2019).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).
Article CAS PubMed PubMed Central Google Scholar
Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987–2993 (2011).
Article CAS PubMed PubMed Central Google Scholar
Wagner, M. et al. Mitochondrial DNA mutation analysis from exome sequencing—a more holistic approach in diagnostics of suspected mitochondrial disease. J. Inherit. Metab. Dis. 42, 909–917 (2019).
Article CAS PubMed Google Scholar
Ye, K. et al. Split-read indel and structural variant calling using PINDEL. Methods Mol. Biol. 1833, 95–105 (2018).
Article CAS PubMed Google Scholar
Plagnol, V. et al. A robust model for read count data in exome sequencing experiments and implications for copy number variant calling. Bioinformatics 28, 2747–2754 (2012).
Article CAS PubMed PubMed Central Google Scholar
McLaren, W. et al. The ensembl variant effect predictor. Genome Biol. 17, 122 (2016).
Article PubMed PubMed Central Google Scholar
Jäger, M. et al. Jannovar: a java library for exome annotation. Hum. Mutat. 35, 548–555 (2014).
Article PubMed Google Scholar
Holtgrewe, M. et al. VarFish: comprehensive DNA variant analysis for diagnostics and research. Nucleic Acids Res. 48, W162–W169 (2020).
Article CAS PubMed PubMed Central Google Scholar
Pedersen, B. S. & Quinlan, A. R. Who’s who? Detecting and resolving sample anomalies in human DNA sequencing studies with Peddy. Am. J. Hum. Genet. 100, 406–413 (2017).
Article CAS PubMed PubMed Central Google Scholar
Pemberton, T. J. et al. Genomic patterns of homozygosity in worldwide human populations. Am. J. Hum. Genet. 91, 275–292 (2012).
Article CAS PubMed PubMed Central Google Scholar
Wang, S., Haynes, C., Barany, F. & Ott, J. Genome-wide autozygosity mapping in human populations. Genet. Epidemiol. 33, 172–180 (2009).
Article PubMed PubMed Central Google Scholar
Narasimhan, V. et al. BCFtools/RoH: a hidden Markov model approach for detecting autozygosity from next-generation sequencing data. Bioinformatics 32, 1749–1751 (2016).
Article CAS PubMed PubMed Central Google Scholar
Richards, S. et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 17, 405–424 (2015).
Article PubMed PubMed Central Google Scholar
Philippakis, A. A. et al. The MatchMaker Exchange: a platform for rare disease gene discovery. Hum. Mutat. 36, 915–921 (2015).
Article PubMed PubMed Central Google Scholar
Sobreira, N. L. M. et al. MatchMaker Exchange. Curr. Protoc. Hum. Genet. 95, 9.31.1–9.31.15 (2017).
PubMed Google Scholar
Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020).
Article CAS PubMed PubMed Central Google Scholar
Samocha, K. E. et al. A framework for the interpretation of de novo mutation in human disease. Nat. Genet. 46, 944–950 (2014).
Article CAS PubMed PubMed Central Google Scholar
R Core Team. R: a language and environment for statistical computing. R Project https://www.R-project.org/ (2021).
Lieberwirth, J. et al. AutoCaSc: prioritizing candidate genes for neurodevelopmental disorders. Hum. Mutat. 43, 1795–1807 (2022).
Article PubMed Google Scholar
Strande, N. T. et al. Evaluating the clinical validity of gene–disease associations: an evidence-based framework developed by the Clinical Genome Resource. Am. J. Hum. Genet. 100, 895–906 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hustinx, A. et al. Improving deep facial phenotyping for ultra-rare disorder verification using model ensembles. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (IEEE, 2023).
Schmidt, A. Code used for the analysis of the TRANSLATE-NAMSE data. Zenodo https://doi.org/10.5281/zenodo.10964188 (2024).

Download references

Acknowledgements

We thank all patients and families from TRANSLATE NAMSE and NIHR BioResource for their cooperation. We thank C. Schmael for proofreading of the manuscript. M.D., H.L.S. and M.A.M. are participants in the BIH Charité (Digital/Junior) Clinician Scientist Program, which is funded by Charité – Universitätsmedizin Berlin and the Berlin Institute of Health (BIH). F. Boschann is a participant in the Clinician Scientist Program (CS4RARE) funded by the Alliance4Rare and associated to the BIH Charité Clinician Scientist Program. A.S. was supported by the BONFOR program of the Medical Faculty, University of Bonn (O-149.0134). M.A.L.-K. received funding from DFG (CRC237 369799452/B21 and CRC237 369799452/A11). C. Schlein received funding from DFG (SCHL2276/2-1; 450149205-TRR333/1). E.T. was funded by NIH awards R01HL161365 and R03HD111492.

Author information

These authors contributed equally: Axel Schmidt, Magdalena Danyel, Kathrin Grundmann, Theresa Brunet.
These authors jointly supervised this work: Peter M. Krawitz, Tobias Haack, Nadja Ehmke, Matias Wagner.

Authors and Affiliations

Institute of Human Genetics, University of Bonn, Medical Faculty and University Hospital Bonn, Bonn, Germany
Axel Schmidt, Hartmut Engels, Sophia Peters, Sugirthan Sivalingam, Claudia Perne, Elisabeth Mangold, Martina Kreiss, Kirsten Cremer, Regina C. Betz, Isabel Spier, André Heimbach, Hellen Lesmann, Sheetal Kumar & Markus Nöthen
Institute for Medical Genetics and Human Genetics, Charité – Universitätsmedizin Berlin, Berlin, Germany
Magdalena Danyel, Felix Boschann, Henrike Lisa Sczakiel, Sarina Schwartzmann, Martin Atta Mensah, Jean Tori Pantel, Uwe Kornak, Claus-Eric Ott, Markus Schülke, Denise Horn, Stefan Mundlos & Nadja Ehmke
BIH Charité Clinician Scientist Program, Berlin Institute of Health at Charité – Universitätsmedizin Berlin, Berlin, Germany
Magdalena Danyel, Felix Boschann, Henrike Lisa Sczakiel, Martin Atta Mensah & Nadja Ehmke
Institute for Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany
Kathrin Grundmann, Martin Kehrer, German Demidov, Stefanie Beck-Wödl, Marc Sturm, Olaf Riess & Tobias Haack
Institute of Human Genetics, Klinikum rechts der Isar, School of Medicine, Technical University of Munich, München, Germany
Theresa Brunet, Dominik S. Westphal, Tim Strom, Reka Kovacs, Korbinian M. Riedhammer, Katharina Mayerhanser, Elisabeth Graf, Melanie Brugger, Julia Hoefele, Ulrich Schatz, Martin Krenn, Thomas Meitinger & Matias Wagner
Institute for Genomic Statistics and Bioinformatics, University of Bonn, Medical Faculty and University Hospital Bonn, Bonn, Germany
Hannah Klinkhammer, Tzung-Chien Hsieh, Alexej Knaus, Fabian Brand, Meghna Ahuja Basin, Pietro Incardona & Peter M. Krawitz
Institut für Medizinische Biometrie, Informatik und Epidemiologie, University of Bonn, Medical Faculty and University Hospital Bonn, Bonn, Germany
Hannah Klinkhammer
Institute for Medical Genetics, Stellenbosch University, Cape Town, South Africa
Shahida Moosa
Department of Pediatrics, University Hospital Düsseldorf, Düsseldorf, Germany
Luisa Averdunk
Institute for Human Genetics and Genomic Medicine, Medical Faculty, Uniklinik RWTH Aachen University, Aachen, Germany
Jean Tori Pantel, Ingo Kurth, Miriam Elbracht, Cordula Knopp, Matthias Begemann & Florian Kraft
Core Uni Bioinformatics, Berlin Institute of Health at Charité – Universitätsmedizin Berlin, Berlin, Germany
Manuel Holtgrewe
Department of Pediatrics, Charité – Universitätsmedizin Berlin, Berlin, Germany
Annemarie Bösch, Claudia Weiß, Natalie Weinhold, Aude-Annick Suter, Corinna Stoltenburg, Julia Neugebauer, Tillmann Kallinich, Susanne Holzhauer, Christoph Bührer & Philip Bufler
Department of Pediatric Neurology, Charité – Universitätsmedizin Berlin, Berlin, Germany
Angela M. Kaindl
Center for Chronically Sick Children, Charité – Universitätsmedizin Berlin, Berlin, Germany
Angela M. Kaindl
Institute of Cell and Neurobiology, Charité – Universitätsmedizin Berlin, Berlin, Germany
Angela M. Kaindl
Department of Human Genetics, Ruhr University Bochum, Bochum, Germany
Hoa Huu Phuc Nguyen & Sabine Hoffjan
Department of Pediatrics Bochum and CeSER, Ruhr University Bochum, Bochum, Germany
Corinna Grasemann, Tobias Rothoeft, Folke Brinkmann & Nora Matar
Center for Rare Diseases, University of Bonn, Medical Faculty and University Hospital Bonn, Bonn, Germany
Martin Mücke, Lorenz Grigull, Tim Bender, Christiane Stieber, Alexandra Marzena Morawiec & Sarah Bernsen
Department of Neurology, University of Bonn, Medical Faculty and University Hospital Bonn, Bonn, Germany
Thomas Klockgether, Patrick Weydt, Sergio Castro-Gomez, Ahmad Aziz, Marcus Grobe-Einsler, Okka Kimmich, Xenia Kobeleva, Demet Önder & Pawel Tacik
Clinic for Internal Medicine III, University of Bonn, Medical Faculty and University Hospital Bonn, Bonn, Germany
Pantelis Karakostas & Valentin S. Schäfer
University Center for Rare Diseases, University Hospital Carl Gustav Carus, Dresden, Germany
Min Ae Lee-Kirsch, Reinhard Berner, Catharina Schuetz, Julia Körholz, Tanita Kretschmer, Nataliya Di Donato, Evelin Schröck, André Heinen, Ulrike Reuner & Amalia-Mihaela Hanßke
Department of Pediatrics, University Hospital Carl Gustav Carus, Dresden, Germany
Min Ae Lee-Kirsch, Reinhard Berner, Catharina Schuetz, Julia Körholz, Tanita Kretschmer & André Heinen
Institute for Clinical Genetics, University Hospital Carl Gustav Carus, Dresden, Germany
Nataliya Di Donato & Evelin Schröck
Department of Neurology, University Hospital Carl Gustav Carus, Dresden, Germany
Ulrike Reuner
Institute of Human Genetics, University Hospital Essen, Essen, Germany
Frank J. Kaiser, Martin Munteanu & Alma Kuechler
Department of Pediatrics II, University Hospital Essen, Essen, Germany
Eva Manka, Kiewert Cordula & Raphael Hirtz
Department of Neurology, University Hospital Halle, Halle, Germany
Elena Schlapakow
Institute of Human Genetics, University Hospital Hamburg-Eppendorf, Hamburg, Germany
Christian Schlein, Jasmin Lisfeld, Christian Kubisch, Theresia Herget & Maja Hempel
Martin Zeitz Center for Rare Diseases, University Hospital Hamburg-Eppendorf, Hamburg, Germany
Christian Kubisch, Maja Hempel, Christina Weiler-Normann, Kurt Ullrich, Christoph Schramm, Cornelia Rudolph, Franziska Rillig & Maximilian Groffmann
Institute of Human Genetics, Heidelberg University, Heidelberg, Germany
Maja Hempel, Alexandra Tibelius, Eva M. C. Schwaibold, Christian P. Schaaf, Michal Zawada, Lilian Kaufmann & Katrin Hinderhofer
I. Department of Medicine, University Hospital Hamburg-Eppendorf, Hamburg, Germany
Christina Weiler-Normann & Christoph Schramm
Department of Pediatrics, University Hospital Hamburg-Eppendorf, Hamburg, Germany
Ania Muntau
Center for Child and Adolescent Medicine, University Hospital Heidelberg, Heidelberg, Germany
Pamela M. Okun, Urania Kotzaeridou, Georg F. Hoffmann, Daniela Choukair & Markus Bettendorf
Institute of Human Genetics, University Hospital Schleswig-Holstein, Lübeck, Germany
Malte Spielmann & Irina Hüning
Center for Rare Diseases, University Hospital Schleswig-Holstein, Lübeck, Germany
Annekatrin Ripke, Alexander Münchau, Tobias Bäumer & Rebecca Herzog
Department of Neurology, University Hospital Schleswig-Holstein, Lübeck, Germany
Martje Pauly & Rebecca Herzog
Institute for Neurogenetics, University Hospital Schleswig-Holstein, Lübeck, Germany
Martje Pauly
Institute of Systems Motor Science, University of Lübeck, Lübeck, Germany
Alexander Münchau & Tobias Bäumer
Institute of Neurogenetics, University of Lübeck, Lübeck, Germany
Katja Lohmann
Institute of Human Genetics, University of Lübeck, Lübeck, Germany
Britta Hanker
Department of Human Genetics, University Hospital Schleswig-Holstein, Lübeck, Germany
Yorck Hellenbroich
Department of Nephrology, Klinikum rechts der Isar, School of Medicine, Technical University of Munich, München, Germany
Korbinian M. Riedhammer
Institute of Neurogenomics, Helmholtz Zentrum München, München, Germany
Konrad Oexle, Nazanin Mirza-Schreiber, Riccardo Berutti & Matias Wagner
Department of Neurology, Medical University of Vienna, Wien, Austria
Martin Krenn
Department of Paediatrics, Adolescent Medicine and Neonatology, München, Germany
Christine Makowski
Dr. von Hauner Children’s Hospital, University Hospital Munich, München, Germany
Heike Weigand, Sebastian Schröder, Meino Rohlfs, Katharina Vill, Fabian Hauck, Ingo Borggraefe, Wolfgang Müller-Felber, Christoph Klein & Matias Wagner
Institute of Human Genetics, University of Leipzig Medical Center, Leipzig, Germany
Johannes R. Lemke, Julia Hentschel, Konrad Platzer, Vincent Strehlow & Rami Abou Jamra
Center for Rare Diseases, University of Leipzig Medical Center, Leipzig, Germany
Johannes R. Lemke
Center for Rare Diseases, University of Tübingen, Tübingen, Germany
Holm Graessner, Lena Zeltner & Janine Magg
Department of Neurology, University of Tübingen, Tübingen, Germany
Ludger J. Schöls
Department of Pediatric Neurology and Developmental Medicine, University of Tübingen, Tübingen, Germany
Andrea Bevot, Christiane Kehrer & Nadja Kaiser
Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Ernest Turro
Berlin Centre for Rare Diseases, Charité – Universitätsmedizin Berlin, Berlin, Germany
Annette Grüters-Kieslich & Heiko Krude

Authors

Axel Schmidt
View author publications
You can also search for this author in PubMed Google Scholar
Magdalena Danyel
View author publications
You can also search for this author in PubMed Google Scholar
Kathrin Grundmann
View author publications
You can also search for this author in PubMed Google Scholar
Theresa Brunet
View author publications
You can also search for this author in PubMed Google Scholar
Hannah Klinkhammer
View author publications
You can also search for this author in PubMed Google Scholar
Tzung-Chien Hsieh
View author publications
You can also search for this author in PubMed Google Scholar
Hartmut Engels
View author publications
You can also search for this author in PubMed Google Scholar
Sophia Peters
View author publications
You can also search for this author in PubMed Google Scholar
Alexej Knaus
View author publications
You can also search for this author in PubMed Google Scholar
Shahida Moosa
View author publications
You can also search for this author in PubMed Google Scholar
Luisa Averdunk
View author publications
You can also search for this author in PubMed Google Scholar
Felix Boschann
View author publications
You can also search for this author in PubMed Google Scholar
Henrike Lisa Sczakiel
View author publications
You can also search for this author in PubMed Google Scholar
Sarina Schwartzmann
View author publications
You can also search for this author in PubMed Google Scholar
Martin Atta Mensah
View author publications
You can also search for this author in PubMed Google Scholar
Jean Tori Pantel
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Holtgrewe
View author publications
You can also search for this author in PubMed Google Scholar
Annemarie Bösch
View author publications
You can also search for this author in PubMed Google Scholar
Claudia Weiß
View author publications
You can also search for this author in PubMed Google Scholar
Natalie Weinhold
View author publications
You can also search for this author in PubMed Google Scholar
Aude-Annick Suter
View author publications
You can also search for this author in PubMed Google Scholar
Corinna Stoltenburg
View author publications
You can also search for this author in PubMed Google Scholar
Julia Neugebauer
View author publications
You can also search for this author in PubMed Google Scholar
Tillmann Kallinich
View author publications
You can also search for this author in PubMed Google Scholar
Angela M. Kaindl
View author publications
You can also search for this author in PubMed Google Scholar
Susanne Holzhauer
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Bührer
View author publications
You can also search for this author in PubMed Google Scholar
Philip Bufler
View author publications
You can also search for this author in PubMed Google Scholar
Uwe Kornak
View author publications
You can also search for this author in PubMed Google Scholar
Claus-Eric Ott
View author publications
You can also search for this author in PubMed Google Scholar
Markus Schülke
View author publications
You can also search for this author in PubMed Google Scholar
Hoa Huu Phuc Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Sabine Hoffjan
View author publications
You can also search for this author in PubMed Google Scholar
Corinna Grasemann
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Rothoeft
View author publications
You can also search for this author in PubMed Google Scholar
Folke Brinkmann
View author publications
You can also search for this author in PubMed Google Scholar
Nora Matar
View author publications
You can also search for this author in PubMed Google Scholar
Sugirthan Sivalingam
View author publications
You can also search for this author in PubMed Google Scholar
Claudia Perne
View author publications
You can also search for this author in PubMed Google Scholar
Elisabeth Mangold
View author publications
You can also search for this author in PubMed Google Scholar
Martina Kreiss
View author publications
You can also search for this author in PubMed Google Scholar
Kirsten Cremer
View author publications
You can also search for this author in PubMed Google Scholar
Regina C. Betz
View author publications
You can also search for this author in PubMed Google Scholar
Martin Mücke
View author publications
You can also search for this author in PubMed Google Scholar
Lorenz Grigull
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Klockgether
View author publications
You can also search for this author in PubMed Google Scholar
Isabel Spier
View author publications
You can also search for this author in PubMed Google Scholar
André Heimbach
View author publications
You can also search for this author in PubMed Google Scholar
Tim Bender
View author publications
You can also search for this author in PubMed Google Scholar
Fabian Brand
View author publications
You can also search for this author in PubMed Google Scholar
Christiane Stieber
View author publications
You can also search for this author in PubMed Google Scholar
Alexandra Marzena Morawiec
View author publications
You can also search for this author in PubMed Google Scholar
Pantelis Karakostas
View author publications
You can also search for this author in PubMed Google Scholar
Valentin S. Schäfer
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Bernsen
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Weydt
View author publications
You can also search for this author in PubMed Google Scholar
Sergio Castro-Gomez
View author publications
You can also search for this author in PubMed Google Scholar
Ahmad Aziz
View author publications
You can also search for this author in PubMed Google Scholar
Marcus Grobe-Einsler
View author publications
You can also search for this author in PubMed Google Scholar
Okka Kimmich
View author publications
You can also search for this author in PubMed Google Scholar
Xenia Kobeleva
View author publications
You can also search for this author in PubMed Google Scholar
Demet Önder
View author publications
You can also search for this author in PubMed Google Scholar
Hellen Lesmann
View author publications
You can also search for this author in PubMed Google Scholar
Sheetal Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Pawel Tacik
View author publications
You can also search for this author in PubMed Google Scholar
Meghna Ahuja Basin
View author publications
You can also search for this author in PubMed Google Scholar
Pietro Incardona
View author publications
You can also search for this author in PubMed Google Scholar
Min Ae Lee-Kirsch
View author publications
You can also search for this author in PubMed Google Scholar
Reinhard Berner
View author publications
You can also search for this author in PubMed Google Scholar
Catharina Schuetz
View author publications
You can also search for this author in PubMed Google Scholar
Julia Körholz
View author publications
You can also search for this author in PubMed Google Scholar
Tanita Kretschmer
View author publications
You can also search for this author in PubMed Google Scholar
Nataliya Di Donato
View author publications
You can also search for this author in PubMed Google Scholar
Evelin Schröck
View author publications
You can also search for this author in PubMed Google Scholar
André Heinen
View author publications
You can also search for this author in PubMed Google Scholar
Ulrike Reuner
View author publications
You can also search for this author in PubMed Google Scholar
Amalia-Mihaela Hanßke
View author publications
You can also search for this author in PubMed Google Scholar
Frank J. Kaiser
View author publications
You can also search for this author in PubMed Google Scholar
Eva Manka
View author publications
You can also search for this author in PubMed Google Scholar
Martin Munteanu
View author publications
You can also search for this author in PubMed Google Scholar
Alma Kuechler
View author publications
You can also search for this author in PubMed Google Scholar
Kiewert Cordula
View author publications
You can also search for this author in PubMed Google Scholar
Raphael Hirtz
View author publications
You can also search for this author in PubMed Google Scholar
Elena Schlapakow
View author publications
You can also search for this author in PubMed Google Scholar
Christian Schlein
View author publications
You can also search for this author in PubMed Google Scholar
Jasmin Lisfeld
View author publications
You can also search for this author in PubMed Google Scholar
Christian Kubisch
View author publications
You can also search for this author in PubMed Google Scholar
Theresia Herget
View author publications
You can also search for this author in PubMed Google Scholar
Maja Hempel
View author publications
You can also search for this author in PubMed Google Scholar
Christina Weiler-Normann
View author publications
You can also search for this author in PubMed Google Scholar
Kurt Ullrich
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Schramm
View author publications
You can also search for this author in PubMed Google Scholar
Cornelia Rudolph
View author publications
You can also search for this author in PubMed Google Scholar
Franziska Rillig
View author publications
You can also search for this author in PubMed Google Scholar
Maximilian Groffmann
View author publications
You can also search for this author in PubMed Google Scholar
Ania Muntau
View author publications
You can also search for this author in PubMed Google Scholar
Alexandra Tibelius
View author publications
You can also search for this author in PubMed Google Scholar
Eva M. C. Schwaibold
View author publications
You can also search for this author in PubMed Google Scholar
Christian P. Schaaf
View author publications
You can also search for this author in PubMed Google Scholar
Michal Zawada
View author publications
You can also search for this author in PubMed Google Scholar
Lilian Kaufmann
View author publications
You can also search for this author in PubMed Google Scholar
Katrin Hinderhofer
View author publications
You can also search for this author in PubMed Google Scholar
Pamela M. Okun
View author publications
You can also search for this author in PubMed Google Scholar
Urania Kotzaeridou
View author publications
You can also search for this author in PubMed Google Scholar
Georg F. Hoffmann
View author publications
You can also search for this author in PubMed Google Scholar
Daniela Choukair
View author publications
You can also search for this author in PubMed Google Scholar
Markus Bettendorf
View author publications
You can also search for this author in PubMed Google Scholar
Malte Spielmann
View author publications
You can also search for this author in PubMed Google Scholar
Annekatrin Ripke
View author publications
You can also search for this author in PubMed Google Scholar
Martje Pauly
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Münchau
View author publications
You can also search for this author in PubMed Google Scholar
Katja Lohmann
View author publications
You can also search for this author in PubMed Google Scholar
Irina Hüning
View author publications
You can also search for this author in PubMed Google Scholar
Britta Hanker
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Bäumer
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca Herzog
View author publications
You can also search for this author in PubMed Google Scholar
Yorck Hellenbroich
View author publications
You can also search for this author in PubMed Google Scholar
Dominik S. Westphal
View author publications
You can also search for this author in PubMed Google Scholar
Tim Strom
View author publications
You can also search for this author in PubMed Google Scholar
Reka Kovacs
View author publications
You can also search for this author in PubMed Google Scholar
Korbinian M. Riedhammer
View author publications
You can also search for this author in PubMed Google Scholar
Katharina Mayerhanser
View author publications
You can also search for this author in PubMed Google Scholar
Elisabeth Graf
View author publications
You can also search for this author in PubMed Google Scholar
Melanie Brugger
View author publications
You can also search for this author in PubMed Google Scholar
Julia Hoefele
View author publications
You can also search for this author in PubMed Google Scholar
Konrad Oexle
View author publications
You can also search for this author in PubMed Google Scholar
Nazanin Mirza-Schreiber
View author publications
You can also search for this author in PubMed Google Scholar
Riccardo Berutti
View author publications
You can also search for this author in PubMed Google Scholar
Ulrich Schatz
View author publications
You can also search for this author in PubMed Google Scholar
Martin Krenn
View author publications
You can also search for this author in PubMed Google Scholar
Christine Makowski
View author publications
You can also search for this author in PubMed Google Scholar
Heike Weigand
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Schröder
View author publications
You can also search for this author in PubMed Google Scholar
Meino Rohlfs
View author publications
You can also search for this author in PubMed Google Scholar
Katharina Vill
View author publications
You can also search for this author in PubMed Google Scholar
Fabian Hauck
View author publications
You can also search for this author in PubMed Google Scholar
Ingo Borggraefe
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Müller-Felber
View author publications
You can also search for this author in PubMed Google Scholar
Ingo Kurth
View author publications
You can also search for this author in PubMed Google Scholar
Miriam Elbracht
View author publications
You can also search for this author in PubMed Google Scholar
Cordula Knopp
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Begemann
View author publications
You can also search for this author in PubMed Google Scholar
Florian Kraft
View author publications
You can also search for this author in PubMed Google Scholar
Johannes R. Lemke
View author publications
You can also search for this author in PubMed Google Scholar
Julia Hentschel
View author publications
You can also search for this author in PubMed Google Scholar
Konrad Platzer
View author publications
You can also search for this author in PubMed Google Scholar
Vincent Strehlow
View author publications
You can also search for this author in PubMed Google Scholar
Rami Abou Jamra
View author publications
You can also search for this author in PubMed Google Scholar
Martin Kehrer
View author publications
You can also search for this author in PubMed Google Scholar
German Demidov
View author publications
You can also search for this author in PubMed Google Scholar
Stefanie Beck-Wödl
View author publications
You can also search for this author in PubMed Google Scholar
Holm Graessner
View author publications
You can also search for this author in PubMed Google Scholar
Marc Sturm
View author publications
You can also search for this author in PubMed Google Scholar
Lena Zeltner
View author publications
You can also search for this author in PubMed Google Scholar
Ludger J. Schöls
View author publications
You can also search for this author in PubMed Google Scholar
Janine Magg
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Bevot
View author publications
You can also search for this author in PubMed Google Scholar
Christiane Kehrer
View author publications
You can also search for this author in PubMed Google Scholar
Nadja Kaiser
View author publications
You can also search for this author in PubMed Google Scholar
Ernest Turro
View author publications
You can also search for this author in PubMed Google Scholar
Denise Horn
View author publications
You can also search for this author in PubMed Google Scholar
Annette Grüters-Kieslich
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Klein
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Mundlos
View author publications
You can also search for this author in PubMed Google Scholar
Markus Nöthen
View author publications
You can also search for this author in PubMed Google Scholar
Olaf Riess
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Meitinger
View author publications
You can also search for this author in PubMed Google Scholar
Heiko Krude
View author publications
You can also search for this author in PubMed Google Scholar
Peter M. Krawitz
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Haack
View author publications
You can also search for this author in PubMed Google Scholar
Nadja Ehmke
View author publications
You can also search for this author in PubMed Google Scholar
Matias Wagner
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Study conceptualization and design: N.E., T. Haack, P.M.K. and M.W. Sample and data acquisition: R.A.J., L.A., A.A., S.B.-W., M. Begemann, T. Bender, R. Berner, S.B., R. Berutti, M. Bettendorf, R.C.B., A. Bevot, I.B., F. Boschann, F. Brand, F. Brinkmann, M. Brugger, T. Brunet, P.B., T. Bäumer, A. Bösch, C.B., S.C.-G., D.C., K.C., M.D., G.D., N.D.D., N.E., M.E., H.E., H.G., E.G., C.G., L.G., M.G.-E., M.G., K.G., A.G.-K., T. Haack, B.H., A.-M.H., F.H., A. Heimbach, A. Heinen, Y.H., M.H., J. Hentschel, T. Herget, R. Herzog, K.H., R. Hirtz, J. Hoefele, S. Hoffjan, G.F.H., S. Holzhauer, D.H., I.H., A.M.K., F.J.K., N.K., T. Kallinich, P.K., V.K., L.T.K., C. Kehrer, M. Kehrer, C. Kiewart, O.K., C. Klein, T. Klockgether, A. Knaus, C. Knopp, X.K., U. Kornak, U. Kotzaeridou, R.K., F.K., P.M.K., M. Kreiss, M. Krenn, T. Kretschmer, H.K., C. Kubisch, A. Kuechler, S.K., I.K., J.K., M.A.L.-K., J.R.L., H.L., J.L., K.L., J.M., C.M., E. Mangold, E. Manka, N.M., K.M., T.M., M.A.M., N.M.-S., A.M.M., S. Mundlos, A.C.M., M. Munteanu, M. Mücke, W.M.-F., A.M., J.N., H.H.P.N., M.N., K.O., P.M.O., C.-E.O., J.T.P., M.P., C.P., S.P., K.P., U.R., K.M.R., O.R., F.R., A.R., M.R., T.R., C.R., C.P.S., U.A.S., E. Schlapakow, C. Schlein, A.S., C. Schramm, E. Schröck, S. Schröder, M. Schuelke, C. Schuetz, E.M.C.S., S. Schwartzmann, V.S.S., L.J.S., H.L.S., S. Sivalingam, M. Spielmann, I.S., C. Stieber, C. Stoltenburg, V.S., T.S., M. Sturm, A.-A.S., P.T., A.T., E.T., K.U., M.W., H.W., C.W.-N., N.W., C.W., D.S.W., P.W., M.Z., L.Z. and D.Ö. Analysis and interpretation: M.A.B., L.A., T. Brunet, M.D., G.D., N.E., H.E., K.G., T. Haack, M.H., T.-C.H., P.I., H. Klinkhammer, A. Knaus, P.M.K., S. Moosa, A.S., S. Sivalingam, E.T. and M.W. Manuscript writing: T. Brunet, M.D., N.E., H.E., K.G., T. Haack, T.-C.H., H. Klinkhammer, P.M.K., A.S. and M.W. Coordination and funding acquisition: N.E., T. Haack, P.M.K., H. Krude, T.M., S. Mundlos, M.N., O.R. and M.W.

Corresponding author

Correspondence to Peter M. Krawitz.

Ethics declarations

Competing interests

V.S.S. has received consultant fees from Novartis, Chugai, AbbVie, Celgene, Sanofi, Lilly, Hexal, Pfizer, Amgen, BMS, Roche, Gilead, Medac, Boehringer-Ingelheim and Alexion and speaker’s bureau fees from AbbVie, Novartis, BMS, Chugai, Celgene, Medac, Sanofi, Lilly, Hexal, Pfizer, Janssen, Roche, Schire, Onkowissen, Royal College London, Boehringer-Ingelheim and UCB Fresenius. M.G.-E. has received research support from the German Ministry of Education and Research (BMBF) within the European Joint Program for Rare Diseases (EJP-RD) 2021 Transnational Call for Rare Disease Research Projects (funding number 01GM2110), from the National Ataxia Foundation (NAF) and from Ataxia UK and received consulting fees from Healthcare Manufaktur, Germany, all unrelated to this study. All other authors declare no competing interests.

Peer review

Peer review information

Nature Genetics thanks Zornitza Stark, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary note and Figs. 1–14 and legends for Supplementary Tables 1–7.

Reporting Summary

Peer Review File

Supplementary Tables

Supplementary Tables 1–7.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Schmidt, A., Danyel, M., Grundmann, K. et al. Next-generation phenotyping integrated in a national framework for patients with ultrarare disorders improves genetic diagnostics and yields new molecular findings. Nat Genet 56, 1644–1653 (2024). https://doi.org/10.1038/s41588-024-01836-1

Download citation

Received: 19 July 2023
Accepted: 18 June 2024
Published: 22 July 2024
Issue Date: August 2024
DOI: https://doi.org/10.1038/s41588-024-01836-1