20 Hits in 0.81 sec

Designing Efficient Spaced Seeds for SOLiD Read Mapping

Laurent Noé, Marta Gîrdea, Gregory Kucherov
2010 Advances in Bioinformatics  
The advent of high-throughput sequencing technologies constituted a major advance in genomic studies, offering new prospects in a wide range of applications.We propose a rigorous and flexible algorithmic solution to mapping SOLiD color-space reads to a reference genome. The solution relies on an advanced method of seed design that uses a faithful probabilistic model of read matches and, on the other hand, a novel seeding principle especially adapted to read mapping. Our method can handle both
more » ... ssy and lossless frameworks and is able to distinguish, at the level of seed design, between SNPs and reading errors. We illustrate our approach by several seed designs and demonstrate their efficiency.
doi:10.1155/2010/708501 pmid:20936175 pmcid:PMC2945724 fatcat:nahdomgtm5bfpf3pk2mrmifh2i

Seed Design Framework for Mapping SOLiD Reads [chapter]

Laurent Noé, Marta Gîrdea, Gregory Kucherov
2010 Lecture Notes in Computer Science  
The advent of high-throughput sequencing technologies constituted a major advance in genomic studies, offering new prospects in a wide range of applications. We propose a rigorous and flexible algorithmic solution to mapping SOLiD color-space reads to a reference genome. The solution relies on an advanced method of seed design that uses a faithful probabilistic model of read matches and, on the other hand, a novel seeding principle especially adapted to read mapping. Our method can handle both
more » ... ossy and lossless frameworks and is able to distinguish, at the level of seed design, between SNPs and reading errors. We illustrate our approach by several seed designs and demonstrate their efficiency.
doi:10.1007/978-3-642-12683-3_25 fatcat:65fdkd7yjrgkrpcoujaeqjlw7y

Back-Translation for Discovering Distant Protein Homologies [chapter]

Marta Gîrdea, Laurent Noé, Gregory Kucherov
2009 Lecture Notes in Computer Science  
Frameshift mutations in protein-coding DNA sequences produce a drastic change in the resulting protein sequence, which prevents classic protein alignment methods from revealing the proteins' common origin. Moreover, when a large number of substitutions are additionally involved in the divergence, the homology detection becomes difficult even at the DNA level. To cope with this situation, we propose a novel method to infer distant homology relations of two proteins, that accounts for frameshift
more » ... nd point mutations that may have affected the coding sequences. We design a dynamic programming alignment algorithm over memory-efficient graph representations of the complete set of putative DNA sequences of each protein, with the goal of determining the two putative DNA sequences which have the best scoring alignment under a powerful scoring system designed to reflect the most probable evolutionary process. This allows us to uncover evolutionary information that is not captured by traditional alignment methods, which is confirmed by biologically significant examples.
doi:10.1007/978-3-642-04241-6_10 fatcat:wawa3jrjtfclhnb6wb54o522b4

Prioritizing Clinically Relevant Copy Number Variation from Genetic Interactions and Gene Function Data

Justin Foong, Marta Girdea, James Stavropoulos, Michael Brudno, Chunyu Liu
2015 PLoS ONE  
It is becoming increasingly necessary to develop computerized methods for identifying the few disease-causing variants from hundreds discovered in each individual patient. This problem is especially relevant for Copy Number Variants (CNVs), which can be cheaply interrogated via low-cost hybridization arrays commonly used in clinical practice. We present a method to predict the disease relevance of CNVs that combines functional context and clinical phenotype to discover clinically harmful CNVs
more » ... nd likely causative genes) in patients with a variety of phenotypes. We compare several feature and gene weighing systems for classifying both genes and CNVs. We combined the best performing methodologies and parameters on over 2,500 Agilent CGH 180k Microarray CNVs derived from 140 patients. Our method achieved an F-score of 91.59%, with 87.08% precision and 97.00% recall. Our methods are freely available at Our dataset is included with the supplementary information.
doi:10.1371/journal.pone.0139656 pmid:26437450 pmcid:PMC4593641 fatcat:msxw5of6r5adpalxgq5fromffi

Back-translation for discovering distant protein homologies in the presence of frameshift mutations

Marta Gîrdea, Laurent Noé, Gregory Kucherov
2010 Algorithms for Molecular Biology  
Algorithms for Molecular Biology 2010, 5:6 Gîrdea et al.  ...  Algorithms for Molecular Biology 2010, 5:6 Page 7 of 15 Gîrdea et al.  ... 
doi:10.1186/1748-7188-5-6 pmid:20047662 pmcid:PMC2821327 fatcat:xzr2xbglmncefdiy3qicl6mtj4

A Hybrid Genetic Programming and Boosting Technique for Learning Kernel Functions from Training Data

Marta Girdea, Liviu Ciortuz
2007 Ninth International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC 2007)  
This paper proposes a technique for learning kernel functions that can be used in non-linear SVM classification. The technique uses genetic programming to evolve kernel functions as additive or multiplicative combinations of linear, polynomial and RBF kernels, while a procedure inspired from InfoBoost helps the evolved kernels concentrate on the most difficult objects to classify. The kernels obtained at each boosting round participate in the training of non-linear SVMs which are combined,
more » ... with their confidence coefficients, into a final classifier. We compared on several data sets the performance of the kernels obtained in this manner with the performance of classic RBF kernels and of kernels evolved using a pure GP method, and we concluded that the boosted GP kernels are generally better.
doi:10.1109/synasc.2007.71 dblp:conf/synasc/GirdeaC07 fatcat:oneogk5vurepbngsaggus2l3by

Care and cost consequences of pediatric whole genome sequencing compared to chromosome microarray

Robin Z. Hayeems, Jasmin Bhawra, Kate Tsiplova, M. Stephen Meyn, Nasim Monfared, Sarah Bowdin, D. James Stavropoulos, Christian R. Marshall, Raveen Basran, Cheryl Shuman, Shinya Ito, Iris Cohn (+6 others)
2017 European Journal of Human Genetics  
The clinical use of whole-genome sequencing (WGS) is expected to alter pediatric medical management. The study aimed to describe the type and cost of healthcare activities following pediatric WGS compared to chromosome microarray (CMA). Healthcare activities prompted by WGS and CMA were ascertained for 101 children with developmental delay over 1 year. Activities following receipt of non-diagnostic CMA were compared to WGS diagnostic and non-diagnostic results. Activities were costed in 2016
more » ... adian dollars (CDN). Ongoing care accounted for 88.6% of post-test activities. The mean number of lab tests was greater following CMA than WGS (0.55 vs. 0.09; p = 0.007). The mean number of specialist visits was greater following WGS than CMA (0.41 vs. 0; p = 0.016). WGS results (diagnostic vs. non-diagnostic) modified the effect of test type on mean number of activities (p < 0.001). The cost of activities prompted by diagnostic WGS exceeded $557CDN for 10% of cases. In complex pediatric care, CMA prompted additional diagnostic investigations while WGS prompted tailored care guided by genotypic variants. Costs for prompted activities were low for the majority and constitute a small proportion of total test costs. Optimal use of WGS depends on robust evaluation of downstream care and cost consequences.
doi:10.1038/s41431-017-0020-3 pmid:29158552 fatcat:tauljtcgabckxbf4wajx6icloy

PhenomeCentral: A Portal for Phenotypic and Genotypic Matchmaking of Patients with Rare Genetic Diseases

Orion J. Buske, Marta Girdea, Sergiu Dumitriu, Bailey Gallinger, Taila Hartley, Heather Trang, Andriy Misyura, Tal Friedman, Chandree Beaulieu, William P. Bone, Amanda E. Links, Nicole L. Washington (+7 others)
2015 Human Mutation  
The discovery of disease-causing mutations typically requires confirmation of the variant or gene in multiple unrelated individuals, and a large number of rare genetic diseases remain unsolved due to difficulty identifying second families. To enable the secure sharing of case records by clinicians and rare disease scientists, we have developed the PhenomeCentral portal (https:// Each record includes a phenotypic description and relevant genetic information (exome or
more » ... e genes). PhenomeCentral identifies similar patients in the database based on semantic similarity between clinical features, automatically prioritized genes from whole-exome data, and candidate genes entered by the users, enabling both hypothesis-free and hypothesis-driven matchmaking. Users can then contact other submitters to follow up on
doi:10.1002/humu.22851 pmid:26251998 pmcid:PMC5467641 fatcat:wt3uvlhaxze55h54byafspg3d4

Distributed Cognition and Process Management Enabling Individualized Translational Research: The NIH Undiagnosed Diseases Program Experience

Amanda E. Links, David Draper, Elizabeth Lee, Jessica Guzman, Zaheer Valivullah, Valerie Maduro, Vlad Lebedev, Maxim Didenko, Garrick Tomlin, Michael Brudno, Marta Girdea, Sergiu Dumitriu (+12 others)
2016 Frontiers in Medicine  
The National Institutes of Health Undiagnosed Diseases Program (NIH UDP) applies translational research systematically to diagnose patients with undiagnosed diseases. The challenge is to implement an information system enabling scalable translational research. The authors hypothesized that similar complex problems are resolvable through process management and the distributed cognition of communities. The team, therefore, built the NIH UDP integrated collaboration system (UDPICS) to form virtual
more » ... collaborative multidisciplinary research networks or communities. UDPICS supports these communities through integrated process management, ontology-based phenotyping, biospecimen management, cloud-based genomic analysis, and an electronic laboratory notebook. UDPICS provided a mechanism for efficient, transparent, and scalable translational research and thereby addressed many of the complex and diverse research and logistical problems of the NIH UDP. Full definition of the strengths and deficiencies of UDPICS will require formal qualitative and quantitative usability and process improvement measurement.
doi:10.3389/fmed.2016.00039 pmid:27785453 pmcid:PMC5060938 fatcat:wfom55rpzzhohelp5l2w6yhtte

Getting Ready for the Human Phenome Project: The 2012 Forum of the Human Variome Project

William S. Oetting, Peter N. Robinson, Marc S. Greenblatt, Richard G. Cotton, Tim Beck, John C. Carey, Sandra C. Doelken, Marta Girdea, Tudor Groza, Carol M. Hamilton, Ada Hamosh, Berit Kerner (+11 others)
2013 Human Mutation  
Marta Girdea, from the University of Toronto, Ontario, spoke for the Finding of Rare Diseases Genes in Canada (FORGE Canada) on ways to ensure accurate phenotyping data is captured in her talk "Easy phenotyping  ... 
doi:10.1002/humu.22293 pmid:23401191 pmcid:PMC4130157 fatcat:olvqoha4mrcstgbq23cpufuhka

Computational evaluation of exome sequence data using human and model organism phenotypes improves diagnostic efficiency

William P. Bone, Nicole L. Washington, Orion J. Buske, David R. Adams, Joie Davis, David Draper, Elise D. Flynn, Marta Girdea, Rena Godfrey, Gretchen Golas, Catherine Groden, Julius Jacobsen (+21 others)
2015 Genetics in Medicine  
doi:10.1038/gim.2015.137 pmid:26562225 pmcid:PMC4916229 fatcat:hpshwxrqyrcojbw5ggzcqotfnm

The Matchmaker Exchange: A Platform for Rare Disease Gene Discovery

Anthony A. Philippakis, Danielle R. Azzariti, Sergi Beltran, Anthony J. Brookes, Catherine A. Brownstein, Michael Brudno, Han G. Brunner, Orion J. Buske, Knox Carey, Cassie Doll, Sergiu Dumitriu, Stephanie O.M. Dyke (+26 others)
2015 Human Mutation  
There are few better examples of the need for data sharing than in the rare disease community, where patients, physicians, and researchers must search for "the needle in a haystack" to uncover rare, novel causes of disease within the genome. Impeding the pace of discovery has been the existence of many small siloed datasets within individual research or clinical laboratory databases and/or disease-specific organizations, hoping for serendipitous occasions when two distant investigators happen
more » ... learn they have a rare phenotype in common and can "match" these cases to build evidence for causality. Research Chair in Law and Medicine; Public Population Project in Genomics and Society (P3G). C 2015 WILEY PERIODICALS, INC. However, serendipity has never proven to be a reliable or scalable approach in science. As such, the Matchmaker Exchange (MME) was launched to provide a robust and systematic approach to rare disease gene discovery through the creation of a federated network connecting databases of genotypes and rare phenotypes using a common application programming interface (API). The core building blocks of the MME have been defined and assembled. Three MME services have now been connected through the API and are available for community use. Additional databases that support internal matching are anticipated to join the MME network as it continues to grow. Hum Mutat 36:915-921, 2015. C 2015 Wiley Periodicals, Inc.
doi:10.1002/humu.22858 pmid:26295439 pmcid:PMC4610002 fatcat:f5r4wjy3rjcyngkpnfuvbqaeey

The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data

Sebastian Köhler, Sandra C. Doelken, Christopher J. Mungall, Sebastian Bauer, Helen V. Firth, Isabelle Bailleul-Forestier, Graeme C. M. Black, Danielle L. Brown, Michael Brudno, Jennifer Campbell, David R. FitzPatrick, Janan T. Eppig (+35 others)
2013 Nucleic Acids Research  
The Human Phenotype Ontology (HPO) project, available at, provides a structured, comprehensive and welldefined set of 10,088 classes (terms) describing human phenotypic abnormalities and 13,326 subclass relations between the HPO classes. In addition we have developed logical definitions for 46% of all HPO classes using terms from ontologies for anatomy, cell types, function, embryology, pathology and other domains. This allows interoperability with
more » ... resources, especially those containing phenotype information on model organisms such as mouse and zebrafish. Here we describe the updated HPO database, which provides annotations of 7,278 human hereditary syndromes listed in OMIM, Orphanet and DECIPHER to classes of the HPO. Various meta-attributes such as frequency, references and negations are associated with each annotation. Several large-scale projects worldwide utilize the HPO for describing phenotype information in their datasets. We have therefore generated equivalence mappings to other phenotype vocabularies such as LDDB, Orphanet, MedDRA, UMLS and phenoDB, allowing integration of existing datasets and interoperability with multiple biomedical resources. We have created various ways to access the HPO database content using flat files, a MySQL database, and Web-based tools. All data and documentation on the HPO project can be found online.
doi:10.1093/nar/gkt1026 pmid:24217912 pmcid:PMC3965098 fatcat:p2ubynvoprbopaeyoboloexzcq

Whole-genome sequencing expands diagnostic utility and improves clinical management in paediatric medicine

Dimitri J Stavropoulos, Daniele Merico, Rebekah Jobling, Sarah Bowdin, Nasim Monfared, Bhooma Thiruvahindrapuram, Thomas Nalpathamkalam, Giovanna Pellecchia, Ryan K C Yuen, Michael J Szego, Robin Z Hayeems, Randi Zlotnik Shaul (+51 others)
2016 npj Genomic Medicine  
The standard of care for first-tier clinical investigation of the aetiology of congenital malformations and neurodevelopmental disorders is chromosome microarray analysis (CMA) for copy-number variations (CNVs), often followed by gene(s)-specific sequencing searching for smaller insertion-deletions (indels) and single-nucleotide variant (SNV) mutations. Whole-genome sequencing (WGS) has the potential to capture all classes of genetic variation in one experiment; however, the diagnostic yield
more » ... mutation detection of WGS compared to CMA, and other tests, needs to be established. In a prospective study we utilised WGS and comprehensive medical annotation to assess 100 patients referred to a paediatric genetics service and compared the diagnostic yield versus standard genetic testing. WGS identified genetic variants meeting clinical diagnostic criteria in 34% of cases, representing a fourfold increase in diagnostic rate over CMA (8%; P value = 1.42E − 05) alone and more than twofold increase in CMA plus targeted gene sequencing (13%; P value = 0.0009). WGS identified all rare clinically significant CNVs that were detected by CMA. In 26 patients, WGS revealed indel and missense mutations presenting in a dominant (63%) or a recessive (37%) manner. We found four subjects with mutations in at least two genes associated with distinct genetic disorders, including two cases harbouring a pathogenic CNV and SNV. When considering medically actionable secondary findings in addition to primary WGS findings, 38% of patients would benefit from genetic counselling. Clinical implementation of WGS as a primary test will provide a higher diagnostic yield than conventional genetic testing and potentially reduce the time required to reach a genetic diagnosis.
doi:10.1038/npjgenmed.2015.12 pmid:28567303 pmcid:PMC5447450 fatcat:kyib3zsflrfnfcm75lxxaen3me

Study regarding the incidence of the subjective simptomatology of the tmj disorders in a sample of adolescents with dento-maxillary anomalies from constanta, treated with different types of orthodontic appliances

Dragos Totolici, Lucian Ieremia, Marta Gîrdea, Cristina Puscasu, Dragos Totolici
This study starts from certain observation of some clinical features regarding the subjective symptoms in connection with the specific pathology of the temporomandibular dysfunction (TMD). The observations were made on a sample of adolescent with orthodontic appliances, so that having different dento-maxillary anomalies. They had to fill in four questionnaires with closed and combined final in order to evaluate the subjective simptomatology of the potential dysfunction of the temporomandibular
more » ... oint (TMJ). On the other hand it was searched for a potential correlation that might be between a sort of a dento-maxillary anomaly and the incidence of the TMD simptomatol-ogy.
« Previous Showing results 1 — 15 out of 20 results