InParanoid 7 : new algorithms and tools for eukaryotic orthology analysis
Gabriel Östlund,Thomas Schmitt,Kristoffer Forslund,Tina Köstler,David N. Messina,Sanjit Roopra,Oliver Frings,Erik L. L. Sonnhammer +7 more
TL;DR: A two-pass BLAST approach was developed that makes use of high-precision compositional score matrix adjustment, but avoids the alignment truncation that sometimes follows in homology assignment.
read more
Abstract: The InParanoid project gathers proteomes of completely sequenced eukaryotic species plus Escherichia coli and calculates pairwise ortholog relationships among them. The new release 7.0 of the database has grown by an order of magnitude over the previous version and now includes 100 species and their collective 1.3 million proteins organized into 42.7 million pairwise ortholog groups. The InParanoid algorithm itself has been revised and is now both more specific and sensitive. Based on results from our recent benchmarking of low-complexity filters in homology assignment, a two-pass BLAST approach was developed that makes use of high-precision compositional score matrix adjustment, but avoids the alignment truncation that sometimes follows. We have also updated the InParanoid web site (http://InParanoid.sbc.su.se). Several features have been added, the response times have been improved and the site now sports a new, clearer look. As the number of ortholog databases has grown, it has become difficult to compare among these resources due to a lack of standardized source data and incompatible representations of ortholog relationships. To facilitate data exchange and comparisons among ortholog databases, we have developed and are making available two XML schemas: SeqXML for the input sequences and OrthoXML for the output ortholog clusters.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Analysis of 41 plant genomes supports a wave of successful genome duplications in association with the Cretaceous-Paleogene boundary
TL;DR: It is argued that considering the evolutionary potential of polyploids in light of the environmental and ecological conditions present around the time ofpolyploidization could mitigate the stark contrast in the proposed evolutionary fates of Polyploids.
438
OrthoDB: a hierarchical catalog of animal, fungal and bacterial orthologs
TL;DR: The update of OrthoDB—the hierarchical catalog of orthologs is presented, which provides computed evolutionary traits of orthology, such as gene duplicability and loss profiles, divergence rates, sibling groups, and now extended with exon–intron architectures, syntenic Orthologs and parent–child trees.
430
The Notch and Wnt pathways regulate stemness and differentiation in human fallopian tube organoids.
Mirjana Kessler,Karen Hoffmann,Volker Brinkmann,Oliver Thieck,Susan Jackisch,Benjamin Toelle,Hilmar Berger,Hans-Joachim Mollenkopf,Mandy Mangler,Jalid Sehouli,Christina Fotopoulou,Thomas F. Meyer +11 more
TL;DR: It is shown that single epithelial stem cells in vitro can give rise to differentiated organoids containing ciliated and secretory cells, and that organoids also respond to oestradiol and progesterone treatment in a physiological manner.
The Ras protein superfamily: Evolutionary tree and role of conserved amino acids
TL;DR: Phylogenetic analysis of gene families at the organism and sequence level revealed complex relationships between the evolution of this protein superfamily sequence and the acquisition of distinct cellular functions.
The water lily genome and the early evolution of flowering plants
Liangsheng Zhang,Fei Chen,Fei Chen,Xingtan Zhang,Zhen Li,Yiyong Zhao,Yiyong Zhao,Rolf Lohaus,Xiaojun Chang,Xiaojun Chang,Wei Dong,Simon Y. W. Ho,Xing Liu,Aixia Song,Junhao Chen,Wenlei Guo,Zhengjia Wang,Yingyu Zhuang,Haifeng Wang,Xuequn Chen,Juan Hu,Yanhui Liu,Yuan Qin,Kai Wang,Shan-Shan Dong,Yang Liu,Shouzhou Zhang,Xianxian Yu,Qian Wu,Liangsheng Wang,Xueqing Yan,Yuannian Jiao,Hongzhi Kong,Xiaofan Zhou,Yu Cuiwei,Chen Yuchu,Fan Li,Jihua Wang,Wei Chen,Xinlu Chen,Qidong Jia,Chi Zhang,Yifan Jiang,Wanbo Zhang,Guanhua Liu,Jianyu Fu,Feng Chen,Feng Chen,Hong Ma,Yves Van de Peer,Yves Van de Peer,Haibao Tang +51 more
TL;DR: The genome of the tropical blue-petal water lily Nymphaea colorata and the transcriptomes from 19 other Nymphaeales species provide insights into the early evolution of angiosperms.
References
NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins
TL;DR: The National Center for Biotechnology Information Reference Sequence (RefSeq) database provides a non-redundant collection of sequences representing genomic data, transcripts and proteins that pragmatically includes sequence data that are currently publicly available in the archival databases.
4.8K
Distinguishing Homologous From Analogous Proteins
TL;DR: This work provides a means by which it is possible to determine whether two groups of related proteins have a common ancestor or are of independent origin, and how many nucleotide positions must differ in the genes encoding the two presumptively homologous proteins.
1.6K
The TIGR Rice Genome Annotation Resource: Improvements and New Features
Shu Ouyang,Wei Zhu,John A. Hamilton,Haining Lin,Matthew Campbell,Kevin L. Childs,Françoise Thibaud-Nissen,Renae L. Malek,Yuandan Lee,Li Zheng,Joshua Orvis,Brian J. Haas,Jennifer R. Wortman,C. Robin Buell +13 more
TL;DR: Through incorporation of multiple transcript and proteomic expression data sets, the Institute for Genomic Research has been able to annotate 24 799 genes (31 739 gene models), representing ∼50% of the total gene models, as expressed in the rice genome.
1.3K
PlasmoDB: a functional genomic database for malaria parasites
Cristina Aurrecoechea,John Brestelli,Brian P. Brunk,Jennifer Dommer,Steve Fischer,Bindu Gajria,Xin Gao,Alan R. Gingle,Gregory R. Grant,Omar S. Harb,Mark Heiges,Frank Innamorato,John Iodice,Jessica C. Kissinger,Eileen Kraemer,Wei Li,John A. Miller,Vishal Nayak,Cary Pennington,Deborah F. Pinney,David S. Roos,Chris Ross,Christian J. Stoeckert,Charles Treatman,Haiming Wang +24 more
TL;DR: PlasmoDB as mentioned in this paper is a functional genomic database for Plasmodium spp. that provides a resource for data analysis and visualization in a gene-by-gene or genome-wide scale.
1.1K
Protein database searches using compositionally adjusted substitution matrices
Stephen F. Altschul,John C. Wootton,E. Michael Gertz,Richa Agarwala,Aleksandr Morgulis,Alejandro A. Schäffer,Yi-Kuo Yu +6 more
TL;DR: This work has recently developed a general procedure for transforming a standard matrix into one appropriate for the comparison of two sequences with arbitrary, and possibly differing compositions.
1.1K
Related Papers (5)
M Ashburner,Catherine A. Ball,Judith A. Blake,David Botstein,Heather Butler,J. M. Cherry,Allan Peter Davis,Kara Dolinski,Selina S. Dwight,J.T. Eppig,Midori A. Harris,David P. Hill,Laurie Issel-Tarver,Andrew Kasarskis,Suzanna E. Lewis,John C. Matese,Joel E. Richardson,M. Ringwald,Gerald M. Rubin,Gavin Sherlock +19 more