Concepedia

Publication | Closed Access

EightyDVec: a method for protein sequence similarity analysis using physicochemical properties of amino acids

10

Citations

49

References

2021

Year

Abstract

Similarity analysis of protein sequences can expose the evolutionary relationship among them. It is required to design effective computational algorithms that can compare the similarities among the colossal amount of sequences. Alignment-based approaches to this problem are often computationally expensive, especially when the number of sequences is large. This research aims to develop an efficient alignment-free tool in the field of protein sequence comparison and phylogenetic study. The proposed method, namely EightyDVec, performs a feature generation process based on the physiochemical properties of amino acids that best describe the evolutionary relationship among the species in a protein family. Using EightyDVec, protein sequences are transformed into 80-dimensional feature vectors and the comparisons between sequences are performed conveniently through these vectors. Four different datasets are used to validate the accuracy of EightyDVec, and the obtained results have shown the great effectiveness of the proposed method in the similarity analysis of protein sequences.

References

YearCitations

Page 1