Journal Article10.1109/tcbb.2022.3191325
Multi-View Kernel Sparse Representation for Identification of Membrane Protein Types
24
TL;DR: In this article , the protein sequence is described via three different views (features), including amino acid composition, evolutionary information and physicochemical properties of amino acids, and a coupling strategy for Kernel Sparse Representation based Classification (KSRC) and construct a new model called Multi-view KSRC (MvKSRC).
read more
Abstract: Membrane proteins are the main undertaker of biomembrane functions and play a vital role in many biological activities of organisms. Prediction of membrane protein types has a great help in determining the function of proteins and understanding the interactions of membrane proteins. However, the biochemical experiment is expensive and not suitable for the large-scale identification of membrane protein types. Therefore, computational methods were used to improve the efficiency of biological experiments. Most existing computational methods only use a single feature of protein, or use multiple features but do not integrate these well. In our study, the protein sequence is described via three different views (features), including amino acid composition, evolutionary information and physicochemical properties of amino acids. To exploit information among all views (features), we introduce a coupling strategy for Kernel Sparse Representation based Classification (KSRC) and construct a new model called Multi-view KSRC (MvKSRC). We implement our method on 4 benchmark data sets of membrane proteins. The comparison results indicate that our method is much superior to all existing methods.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Molecular Joint Representation Learning via Multi-modal Information of SMILES and Graphs
TL;DR: In this paper , the authors proposed a novel framework of molecular joint representation learning via multi-modal information of simplified molecular input line entry system (SMILES) and molecular graph, called MMSG.
14
MVML-MPI: Multi-View Multi-Label Learning for Metabolic Pathway Inference.
Xiaoyi Liu,Hongpeng Yang,Chengwei Ai,Yijie Ding,Fei Guo,Jijun Tang +5 more
TL;DR: MVML-MPI accurately represents and effectively captures the complex relationship between compounds and metabolic pathways and distinguishes itself from current machine learning-based methods.
8
Prediction of blood–brain barrier penetrating peptides based on data augmentation with Augur
Zhi-Feng Gu,Yu-Duo Hao,Tian-Yu Wang,Peiling Cai,Yang Zhang,Ke-Jun Deng,Hao Lin,Hao Lv +7 more
TL;DR: This newly developed Augur model demonstrates superior performance in predicting blood–brain barrier penetrating peptides, offering valuable insights for drug development targeting neurological disorders and paving the way for innovative treatment strategies for central nervous system diseases.
7
Multi-view local hyperplane nearest neighbor model based on independence criterion for identifying vesicular transport proteins.
Yijie Ding,Quan Zou +1 more
TL;DR: A novel multi-view classifier called graph-regularized k-local hyperplane distance nearest neighbor model (HSIC-GHKNN), which combines the Hilbert-Schmidt independence criterion (HSic)-based multi- view learning method with a local hyperplanedistance nearest-neighbor classifier, which outperformed existing methods in most evaluation metrics.
4
Identification of DNase I hypersensitive sites in the human genome by multiple sequence descriptors
Yan-Ting Jin,Yang Tan,Zhong‐Ru Gan,Yanna Hao,Tianyu Wang,Hao Lin,Bo Tang +6 more
TL;DR: This model can assist scholars conducting DNase research in identifying DHSs and was proposed to perform the final model construction by comparing the prediction performance of various classification algorithms through five-fold cross-validation.
2
References
Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences
Weizhong Li,Adam Godzik +1 more
TL;DR: Cd-hit-2d compares two protein datasets and reports similar matches between them; cd- Hit-est clusters a DNA/RNA sequence database and cd- hit-est-2D compares two nucleotide datasets.
10.7K
Robust Face Recognition via Sparse Representation
TL;DR: This work considers the problem of automatically recognizing human faces from frontal views with varying expression and illumination, as well as occlusion and disguise, and proposes a general classification algorithm for (image-based) object recognition based on a sparse representation computed by C1-minimization.
$rm K$ -SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation
TL;DR: A novel algorithm for adapting dictionaries in order to achieve sparse signal representations, the K-SVD algorithm, an iterative method that alternates between sparse coding of the examples based on the current dictionary and a process of updating the dictionary atoms to better fit the data.
10K
Emergence of simple-cell receptive field properties by learning a sparse code for natural images
TL;DR: It is shown that a learning algorithm that attempts to find sparse linear codes for natural scenes will develop a complete family of localized, oriented, bandpass receptive fields, similar to those found in the primary visual cortex.
•Book
Least Squares Support Vector Machines
Johan A. K. Suykens,Tony Van Gestel,Jos De Brabanter,Bart De Moor,Joos Vandewalle +4 more
- 12 Nov 2002
TL;DR: Support Vector Machines Basic Methods of Least Squares Support Vector Machines Bayesian Inference for LS-SVM Models Robustness Large Scale Problems LS- sVM for Unsupervised Learning LS- SVM for Recurrent Networks and Control.
Related Papers (5)
Kristin P. Bennett
- 22 May 2005
Jeong-Woo Son,Seong-Bae Park,Ku-Jin Kim +2 more
- 01 Aug 2007
Shankar Vembu,Sandra Zilles +1 more
Khanh Nguyen
- 01 Aug 2017