Discovery of Novel Conotoxin Candidates Using Machine Learning

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Discovery of Novel Conotoxin Candidates Using Machine Learning. / Li, Qing; Watkins, Maren; Robinson, Samuel D; Safavi-Hemami, Helena; Yandell, Mark.

I: Toxins, Bind 10, Nr. 12, 503, 2018.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Li, Q, Watkins, M, Robinson, SD, Safavi-Hemami, H & Yandell, M 2018, 'Discovery of Novel Conotoxin Candidates Using Machine Learning', Toxins, bind 10, nr. 12, 503. https://doi.org/10.3390/toxins10120503

APA

Li, Q., Watkins, M., Robinson, S. D., Safavi-Hemami, H., & Yandell, M. (2018). Discovery of Novel Conotoxin Candidates Using Machine Learning. Toxins, 10(12), [503]. https://doi.org/10.3390/toxins10120503

Vancouver

Li Q, Watkins M, Robinson SD, Safavi-Hemami H, Yandell M. Discovery of Novel Conotoxin Candidates Using Machine Learning. Toxins. 2018;10(12). 503. https://doi.org/10.3390/toxins10120503

Author

Li, Qing ; Watkins, Maren ; Robinson, Samuel D ; Safavi-Hemami, Helena ; Yandell, Mark. / Discovery of Novel Conotoxin Candidates Using Machine Learning. I: Toxins. 2018 ; Bind 10, Nr. 12.

Bibtex

@article{ce79665c45584bdb8e6144f1572a37b3,
title = "Discovery of Novel Conotoxin Candidates Using Machine Learning",
abstract = "Cone snails (genus Conus) are venomous marine snails that inject prey with a lethal cocktail of conotoxins, small, secreted, and cysteine-rich peptides. Given the diversity and often high affinity for their molecular targets, consisting of ion channels, receptors or transporters, many conotoxins have become invaluable pharmacological probes, drug leads, and therapeutics. Transcriptome sequencing of Conus venom glands followed by de novo assembly and homology-based toxin identification and annotation is currently the state-of-the-art for discovery of new conotoxins. However, homology-based search techniques, by definition, can only detect novel toxins that are homologous to previously reported conotoxins. To overcome these obstacles for discovery, we have created ConusPipe, a machine learning tool that utilizes prominent chemical characters of conotoxins to predict whether a certain transcript in a Conus transcriptome, which has no otherwise detectable homologs in current reference databases, is a putative conotoxin. By using ConusPipe on RNASeq data of 10 species, we report 5148 new putative conotoxin transcripts that have no homologues in current reference databases. 896 of these were identified by at least three out of four models used. These data significantly expand current publicly available conotoxin datasets and our approach provides a new computational avenue for the discovery of novel toxin families.",
keywords = "Animals, Conotoxins/genetics, Conus Snail/genetics, Machine Learning, Sequence Analysis, RNA, Transcriptome",
author = "Qing Li and Maren Watkins and Robinson, {Samuel D} and Helena Safavi-Hemami and Mark Yandell",
year = "2018",
doi = "10.3390/toxins10120503",
language = "English",
volume = "10",
journal = "Toxins",
issn = "2072-6651",
publisher = "M D P I AG",
number = "12",

}

RIS

TY - JOUR

T1 - Discovery of Novel Conotoxin Candidates Using Machine Learning

AU - Li, Qing

AU - Watkins, Maren

AU - Robinson, Samuel D

AU - Safavi-Hemami, Helena

AU - Yandell, Mark

PY - 2018

Y1 - 2018

N2 - Cone snails (genus Conus) are venomous marine snails that inject prey with a lethal cocktail of conotoxins, small, secreted, and cysteine-rich peptides. Given the diversity and often high affinity for their molecular targets, consisting of ion channels, receptors or transporters, many conotoxins have become invaluable pharmacological probes, drug leads, and therapeutics. Transcriptome sequencing of Conus venom glands followed by de novo assembly and homology-based toxin identification and annotation is currently the state-of-the-art for discovery of new conotoxins. However, homology-based search techniques, by definition, can only detect novel toxins that are homologous to previously reported conotoxins. To overcome these obstacles for discovery, we have created ConusPipe, a machine learning tool that utilizes prominent chemical characters of conotoxins to predict whether a certain transcript in a Conus transcriptome, which has no otherwise detectable homologs in current reference databases, is a putative conotoxin. By using ConusPipe on RNASeq data of 10 species, we report 5148 new putative conotoxin transcripts that have no homologues in current reference databases. 896 of these were identified by at least three out of four models used. These data significantly expand current publicly available conotoxin datasets and our approach provides a new computational avenue for the discovery of novel toxin families.

AB - Cone snails (genus Conus) are venomous marine snails that inject prey with a lethal cocktail of conotoxins, small, secreted, and cysteine-rich peptides. Given the diversity and often high affinity for their molecular targets, consisting of ion channels, receptors or transporters, many conotoxins have become invaluable pharmacological probes, drug leads, and therapeutics. Transcriptome sequencing of Conus venom glands followed by de novo assembly and homology-based toxin identification and annotation is currently the state-of-the-art for discovery of new conotoxins. However, homology-based search techniques, by definition, can only detect novel toxins that are homologous to previously reported conotoxins. To overcome these obstacles for discovery, we have created ConusPipe, a machine learning tool that utilizes prominent chemical characters of conotoxins to predict whether a certain transcript in a Conus transcriptome, which has no otherwise detectable homologs in current reference databases, is a putative conotoxin. By using ConusPipe on RNASeq data of 10 species, we report 5148 new putative conotoxin transcripts that have no homologues in current reference databases. 896 of these were identified by at least three out of four models used. These data significantly expand current publicly available conotoxin datasets and our approach provides a new computational avenue for the discovery of novel toxin families.

KW - Animals

KW - Conotoxins/genetics

KW - Conus Snail/genetics

KW - Machine Learning

KW - Sequence Analysis, RNA

KW - Transcriptome

U2 - 10.3390/toxins10120503

DO - 10.3390/toxins10120503

M3 - Journal article

C2 - 30513724

VL - 10

JO - Toxins

JF - Toxins

SN - 2072-6651

IS - 12

M1 - 503

ER -

ID: 232823063