Publication | Open Access
Identification of amino acids with sensitive nanoporous MoS2: towards machine learning-based prediction
77
Citations
32
References
2018
Year
EngineeringAmino AcidsChemical AnalysisMolecular BiologyComputational ChemistryChemistryMolecular DesignBiosensing SystemsMachine Learning TechniquesMolecular SimulationMolecular RecognitionComputational BiochemistryBiophysicsMolecular SciencesSensitive Nanoporous Mos2Protein ModelingAbstract Protein DetectionBioinformaticsMolecular ModelingBiomolecular ScienceProtein BioinformaticsBiomolecular EngineeringNatural SciencesMolecular PropertyComputational BiologyMolecular BiophysicsNanopore TechnologyNanopores
Abstract Protein detection plays a key role in determining the single point mutations which can cause a variety of diseases. Nanopore sequencing provides a label-free, single base, fast and long reading platform, which makes it amenable for personalized medicine. A challenge facing nanopore technology is the noise in ionic current. Here, we show that a nanoporous single-layer molybdenum disulfide (MoS 2 ) can detect individual amino acids in a polypeptide chain (16 units) with a high accuracy and distinguishability. Using extensive molecular dynamics simulations (with a total aggregate simulation time of 66 µs) and machine learning techniques, we featurize and cluster the ionic current and residence time of the 20 amino acids and identify the fingerprints of the signals. Using logistic regression, nearest neighbor, and random forest classifiers, the sensor reading is predicted with an accuracy of 72.45, 94.55, and 99.6%, respectively. In addition, using advanced ML classification techniques, we are able to theoretically predict over 2.8 million hypothetical sensor readings’ amino acid types.
| Year | Citations | |
|---|---|---|
Page 1
Page 1