Genome mining

Genome mining is associated with bioinformatics investigations.

Genome mining describes the exploitation of genomic information for the discovery of biosynthetic pathways of natural products and their possible interactions.[1] It depends on computational technology and bioinformatics tools. The mining process relies on a huge amount of data (represented by DNA sequences and annotations) accessible in genomic databases. By applying data mining algorithms, the data can be used to generate new knowledge in several areas of medicinal chemistry,[2][3] such as discovering novel natural products.[4]

History

In the mid- to late 1980s, researchers have increasingly focused on genetic studies with the advancing sequencing technologies.[5] The GenBank database was established in 1982 for the collection, management, storage, and distribution of DNA sequence data due to the increasing availability of DNA sequences. With the increasing number of genetic data, biotechnological companies have been able to use human DNA sequence to develop protein and antibody drugs through genome mining since 1992.[6] In the late 1990s, many companies, such as Amgen, Immunec, Genentech were able to develop drugs that progressed to the clinical stage by adopting genome mining.[7] Since the Human Genome Project was completed in the early 2000, researchers have been sequencing the genomes of many microorganisms.[8] Subsequently, many of these genomes have been carefully studied to identify new genes and biosynthetic pathways.[9]

Algorithms

As large quantities of genomic sequence data began to accumulate in public databases, genetic algorithms became important to decipher the enormous collection of genomic data. They are commonly used to generate high-quality solutions to optimization and search problems by relying on bio-inspired operators such as mutation, crossover and selection.[10] The followings are commonly used genetic algorithms:

  • AntiSMASH (Antibiotics and Secondary Metabolite Analysis Shell)[11] addresses secondary metabolite genome pipelines.[12]
  • PRISM (Prediction Informatics for Secondary Metabolites)[13] is a combinatorial approach to chemical structure prediction for genetically encoded nonribosomal peptides and type I and II polyketides.[14]
  • SIM (Statistically based sequence similarity) method, such as FASTA or PSI-BLAST[broken anchor], infer orthologous homology.[15]
  • BLAST (Basic local alignment search tool) is an approach for rapid sequence comparison.[16]

Applications

Genome mining applies on the discovery of natural product by facilitating the characterization of novel molecules and biosynthetic pathways.[4][17]

Natural product discovery

The production of natural products is regulated by the biosynthetic gene clusters (BGCs) encoded in the microorganism.[18] By adopting genome mining, the BGCs that produce the target natural product can be predicted.[19] Some important enzymes responsible for the formation of natural products are polyketide synthases (PKS), non-ribosomal peptide synthases (NRPS), ribosomally and post-translationally modified peptides (RiPPs), and terpenoids, and many more.[20] Mining for enzymes, researchers can figure out the classes that BGCs encode and compare target gene clusters to known gene clusters.[21] To verify the relation between the BGCs and natural products, the target BGCs can be expressed by suitable host through the use of molecular cloning.[22]

Databases and tools

Genetic data has been accumulated in databases. Researchers are able to utilize algorithms to decipher the data accessible from databases for the discovery of new processes, targets, and products.[10] The following are databases and tools:

  • GenBank database provides genomic datasets for analysis.[23]
  • UCSC Genome Browser
  • AntiSMASH-DB[11][24] allows comparing the sequences of newly sequenced BGCs against those of previously predicted and experimentally characterized ones.[25]
  • BIG-FAM [26] is a biosynthetic gene cluster family database.[27]
  • DoBISCUIT[28] is a database of secondary metabolite biosynthetic gene clusters.[29]
  • MIBiG (Minimum Information about a Biosynthetic Gene cluster specification)[30] provides a standard for annotations and metadata on biosynthetic gene clusters and their molecular products.[31]
  • Interactive tree of life (iTOL)[32] is a web-based tool for the display, manipulation and annotation of phylogenetic trees.[33]

References

  1. ^ Albarano L, Esposito R, Ruocco N, Costantini M (April 2020). "Genome Mining as New Challenge in Natural Products Discovery". Marine Drugs. 18 (4): 199. doi:10.3390/md18040199. PMC 7230286. PMID 32283638.
  2. ^ Hannigan GD, Prihoda D, Palicka A, Soukup J, Klempir O, Rampula L, et al. (October 2019). "A deep learning genome-mining strategy for biosynthetic gene cluster prediction". Nucleic Acids Research. 47 (18): e110. doi:10.1093/nar/gkz654. PMC 6765103. PMID 31400112.
  3. ^ Lee N, Hwang S, Kim J, Cho S, Palsson B, Cho BK (2020-01-01). "Mini review: Genome mining approaches for the identification of secondary metabolite biosynthetic gene clusters in Streptomyces". Computational and Structural Biotechnology Journal. 18: 1548–1556. doi:10.1016/j.csbj.2020.06.024. PMC 7327026. PMID 32637051.
  4. ^ a b Challis GL (May 2008). "Genome mining for novel natural product discovery". Journal of Medicinal Chemistry. 51 (9): 2618–2628. doi:10.1021/jm700948z. PMID 18393407.
  5. ^ Bains W, Smith GC (December 1988). "A novel method for nucleic acid sequence determination". Journal of Theoretical Biology. 135 (3): 303–307. Bibcode:1988JThBi.135..303B. doi:10.1016/S0022-5193(88)80246-7. PMID 3256722.
  6. ^ Cook-Deegan R, Heaney C (2010-09-01). "Patents in genomics and human genetics". Annual Review of Genomics and Human Genetics. 11 (1): 383–425. doi:10.1146/annurev-genom-082509-141811. PMC 2935940. PMID 20590431.
  7. ^ Ziemert N, Alanjary M, Weber T (August 2016). "The evolution of genome mining in microbes - a review". Natural Product Reports. 33 (8): 988–1005. doi:10.1039/C6NP00025H. PMID 27272205.
  8. ^ Omura S, Ikeda H, Ishikawa J, Hanamoto A, Takahashi C, Shinose M, et al. (October 2001). "Genome sequence of an industrial microorganism Streptomyces avermitilis: deducing the ability of producing secondary metabolites". Proceedings of the National Academy of Sciences of the United States of America. 98 (21): 12215–12220. Bibcode:2001PNAS...9812215O. doi:10.1073/pnas.211433198. PMC 59794. PMID 11572948.
  9. ^ Tang X, Li J, Millán-Aguiñaga N, Zhang JJ, O'Neill EC, Ugalde JA, et al. (December 2015). "Identification of Thiotetronic Acid Antibiotic Biosynthetic Pathways by Target-directed Genome Mining". ACS Chemical Biology. 10 (12): 2841–2849. doi:10.1021/acschembio.5b00658. PMC 4758359. PMID 26458099.
  10. ^ a b Brandon MC, Wallace DC, Baldi P (July 2009). "Data structures and compression algorithms for genomic sequence data". Bioinformatics. 25 (14): 1731–1738. doi:10.1093/bioinformatics/btp319. PMC 2705231. PMID 19447783.
  11. ^ a b "AntiSMASH-DB".
  12. ^ Medema MH, Blin K, Cimermancic P, de Jager V, Zakrzewski P, Fischbach MA, et al. (July 2011). "antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences". Nucleic Acids Research. 39 (Web Server issue): W339–W346. doi:10.1093/nar/gkr466. PMC 3125804. PMID 21672958.
  13. ^ "PRISM". Adapsyn Bioscience.
  14. ^ Skinnider MA, Johnston CW, Gunabalasingam M, Merwin NJ, Kieliszek AM, MacLellan RJ, et al. (November 2020). "Comprehensive prediction of secondary metabolite structure and biological activity from microbial genome sequences". Nature Communications. 11 (1): 6058. Bibcode:2020NatCo..11.6058S. doi:10.1038/s41467-020-19986-1. PMC 7699628. PMID 33247171.
  15. ^ King RD, Wise PH, Clare A (May 2004). "Confirmation of data mining based predictions of protein function". Bioinformatics. 20 (7): 1110–1118. doi:10.1093/bioinformatics/bth047. PMID 14764546.
  16. ^ Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (October 1990). "Basic local alignment search tool". Journal of Molecular Biology. 215 (3): 403–410. doi:10.1016/S0022-2836(05)80360-2. PMID 2231712.
  17. ^ Medema MH, de Rond T, Moore BS (September 2021). "Mining genomes to illuminate the specialized chemistry of life". Nature Reviews. Genetics. 22 (9): 553–571. doi:10.1038/s41576-021-00363-7. PMC 8364890. PMID 34083778.
  18. ^ Rutledge PJ, Challis GL (August 2015). "Discovery of microbial natural products by activation of silent biosynthetic gene clusters". Nature Reviews. Microbiology. 13 (8): 509–523. doi:10.1038/nrmicro3496. PMID 26119570. S2CID 6474118.
  19. ^ Belknap KC, Park CJ, Barth BM, Andam CP (February 2020). "Genome mining of biosynthetic and chemotherapeutic gene clusters in Streptomyces bacteria". Scientific Reports. 10 (1): 2003. Bibcode:2020NatSR..10.2003B. doi:10.1038/s41598-020-58904-9. PMC 7005152. PMID 32029878.
  20. ^ Hoffmeister D, Keller NP (April 2007). "Natural products of filamentous fungi: enzymes, genes, and their regulation". Natural Product Reports. 24 (2): 393–416. doi:10.1039/B603084J. PMID 17390002.
  21. ^ Micallef ML, D'Agostino PM, Sharma D, Viswanathan R, Moffitt MC (September 2015). "Genome mining for natural product biosynthetic gene clusters in the Subsection V cyanobacteria". BMC Genomics. 16 (1): 669. doi:10.1186/s12864-015-1855-z. PMC 4558948. PMID 26335778.
  22. ^ Gomez-Escribano JP, Bibb MJ (February 2014). "Heterologous expression of natural product biosynthetic gene clusters in Streptomyces coelicolor: from genome mining to manipulation of biosynthetic pathways". Journal of Industrial Microbiology & Biotechnology. 41 (2): 425–431. doi:10.1007/s10295-013-1348-5. PMID 24096958. S2CID 15215660.
  23. ^ Sayers EW, Cavanaugh M, Clark K, Pruitt KD, Schoch CL, Sherry ST, Karsch-Mizrachi I (January 2021). "GenBank". Nucleic Acids Research. 49 (D1): D92–D96. doi:10.1093/nar/gkaa1023. PMC 7778897. PMID 33196830.
  24. ^ "IMG-ABC".
  25. ^ Palaniappan K, Chen IA, Chu K, Ratner A, Seshadri R, Kyrpides NC, et al. (January 2020). "IMG-ABC v.5.0: an update to the IMG/Atlas of Biosynthetic Gene Clusters Knowledgebase". Nucleic Acids Research. 48 (D1): D422–D430. doi:10.1093/nar/gkz932. PMC 7145673. PMID 31665416.
  26. ^ "BIG-FAM".
  27. ^ Kautsar SA, Blin K, Shaw S, Weber T, Medema MH (January 2021). "BiG-FAM: the biosynthetic gene cluster families database". Nucleic Acids Research. 49 (D1): D490–D497. doi:10.1093/nar/gkaa812. PMC 7778980. PMID 33010170.
  28. ^ "DoBISCUIT".
  29. ^ Ichikawa N, Sasagawa M, Yamamoto M, Komaki H, Yoshida Y, Yamazaki S, Fujita N (January 2013). "DoBISCUIT: a database of secondary metabolite biosynthetic gene clusters". Nucleic Acids Research. 41 (Database issue): D408–D414. doi:10.1093/nar/gks1177. PMC 3531092. PMID 23185043.
  30. ^ "MIBiG".
  31. ^ Kautsar SA, Blin K, Shaw S, Navarro-Muñoz JC, Terlouw BR, van der Hooft JJ, et al. (January 2020). "MIBiG 2.0: a repository for biosynthetic gene clusters of known function". Nucleic Acids Research. 48 (D1): D454–D458. doi:10.1093/nar/gkz882. PMC 7145714. PMID 31612915.
  32. ^ "iTOL".
  33. ^ Letunic I, Bork P (July 2016). "Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees". Nucleic Acids Research. 44 (W1): W242–W245. doi:10.1093/nar/gkw290. PMC 4987883. PMID 27095192.

Read other articles:

Ernest GrueningSenator Amerika Serikat dari AlaskaMasa jabatan3 Januari 1959 – 3 Januari 1969PendahuluKursi dibentukPenggantiMike GravelGubernur Teritorial Alaska ke-7Masa jabatan6 Desember 1939 – 10 April 1953WakilBob BartlettPendahuluJohn TroyPenggantiFrank Heintzleman Informasi pribadiLahirErnest Henry Gruening(1887-02-06)6 Februari 1887New York City, New York, Amerika SerikatMeninggal26 Juni 1974(1974-06-26) (umur 87)Washington, D.C., Amerika SerikatPartai polit...

 

Este artículo o sección tiene referencias, pero necesita más para complementar su verificabilidad.Este aviso fue puesto el 19 de septiembre de 2022. Celda unitaria definida por tres ejes cristalográficos a, b y c, y tres ángulos interaxiales α, β, γ. Un parámetro de red es una distancia o un ángulo que define geométricamente a la celda unitaria con respecto a su estructura cristalina. La celda unitaria queda definida por seis parámetros: la longitud de tres aristas a, b y c; y tres

 

Johnny Cash's Greatest Hits, Vol. 1grandes éxitos de Johnny CashPublicación Julio de 1967Género(s) CountryDuración 30:46Discográfica Columbia RecordsProductor(es) Don Law, Frank JonesCalificaciones profesionales Allmusic enlace Cronología de Johnny Cash Happiness is You(1966) Johnny Cash's Greatest Hits, Vol. 1 Carryin' on with Johnny Cash and June Carter(1967) [editar datos en Wikidata] Johnny Cash's Greatest Hits, Vol.1 es un álbum de recopilación de los mejores y m

American singer This biography of a living person needs additional citations for verification. Please help by adding reliable sources. Contentious material about living persons that is unsourced or poorly sourced must be removed immediately from the article and its talk page, especially if potentially libelous.Find sources: K.Flay – news · newspapers · books · scholar · JSTOR (May 2019) (Learn how and when to remove this template message) K.FlayK.Flay ...

 

Umareru.GenreKeluargaDitulis olehOsamu SuzukiSutradaraFuminori Kaneko, Akio Yoshida, Jun MutoPemeranMisako TanakaMaki HorikitaTadayoshi OkuraKento NakajimaSeika TaketomiRen OsugiPenggubah lagu temaYugo Kanno, Hirohide ShidaLagu penutup365 Nichi Kazoku oleh Kanjani8Negara asalJepangBahasa asliBahasa JepangJmlh. musim1Jmlh. episode10ProduksiProduserSanae SuzukiRilisJaringan asliTBSRilis asli22 April (2011-04-22) –24 Juni 2011 (2011-6-24)Pranala luar[Situs resmi Situs web] Umare...

 

Mexican drug cartel This article's lead section may be too long. Please read the length guidelines and help move details into the article's body. (June 2022) Jalisco CartelCártel Jalisco Nueva GeneraciónLogo of the Jalisco New Generation CartelFounded31 August 2009FounderNemesio Oseguera Cervantes, Emilio Alejandro Pulido Salazar, Martin Arzola Ortega, and Erick Valencia Salazar[1]Founding locationGuadalajara, Jalisco, México[2][3]Years active2009–presen...

Este artigo ou secção contém uma lista de referências no fim do texto, mas as suas fontes não são claras porque não são citadas no corpo do artigo, o que compromete a confiabilidade das informações. Ajude a melhorar este artigo inserindo citações no corpo do artigo. (Novembro de 2020) Kondo Isami na Batalha de Koshu-Katsunuma. Tosa Jinshotai (迅衝隊) (Da esquerda na linha inferior: Ban Gondayu, Itagaki Taisuke, Tani Otoi (jovem rapaz), Yamachi Motoharu. Da esquerda na linha do ...

 

Fachada del teatro (2006) El Teatro Tuschinski (en neerlandés, Theater Tuschinski) es un teatro situado en Ámsterdam. Empezó a utilizarlo el empresario polaco-neerlandés Abraham Icek Tuschinski, quien costeó su construcción por un precio de 4 millones de florines. Diseñado por Hijman Louis el joven, el teatro aúna diversos estilos artísticos: art déco, modernismo y escuela amsterdamesa. Actualmente, se suele emplear para estrenos de películas. La fachada tiene un estilo art decó c...

 

2005 single by DarinStep UpSingle by Darinfrom the album Darin Released7 September 2005Recorded2005GenrePop, R&B, DanceLength3:07LabelColumbia/Sony BMGSongwriter(s)RedOne, Darin, Bilal HajjiProducer(s)RedOneDarin singles chronology Why Does It Rain (2005) Step Up (2005) Who's That Girl (2005) Step Up is a song by Swedish singer Darin, released in September 2005 as the lead single from his second album, Darin. The song was co-written by Darin with RedOne and Bilal Hajji and debuted at numb...

Untuk kegunaan lain, lihat Rhinoplasti (disambiguasi). RhinoplastiIntervensiRhinoplasti: Kartilag lateral bawah (kartilag alar besar) diberi modifikasi plastik melalui nostril kiri.Pelafalan/ˈraɪnəˌplæsti/ ICD-9-CM21.87MeSHD012225MedlinePlus002983[sunting di Wikidata] Rhinoplasti (bahasa Yunani: ῥίς rhis, hidung + πλάσσειν plassein, membentuk), yang umumnya disebut sebagai pekerjaan hidung, adalah sebuah prosedur pembedahan plastik untuk membenarkan dan merekonst...

 

Artikel ini sebatang kara, artinya tidak ada artikel lain yang memiliki pranala balik ke halaman ini.Bantulah menambah pranala ke artikel ini dari artikel yang berhubungan atau coba peralatan pencari pranala.Tag ini diberikan pada Oktober 2016. Artikel ini memberikan informasi dasar tentang topik kesehatan. Informasi dalam artikel ini hanya boleh digunakan hanya untuk penjelasan ilmiah, bukan untuk diagnosis diri dan tidak dapat menggantikan diagnosis medis. Perhatian: Informasi dalam artikel...

 

1932 film A Star DisappearsDirected byRobert VillersWritten byMarcel AchardStarringSuzy Vernon Constant Rémy Alexandre DréanMusic byLionel CazauxProductioncompanyLes Studios ParamountDistributed byLes Films ParamountRelease date22 August 1932Running time74 minutesCountryFranceLanguageFrench A Star Disappears (French: Une étoile disparaît) is a 1932 French comedy film directed by Robert Villers and starring Suzy Vernon, Constant Rémy and Alexandre Dréan.[1] It was made at the Joi...

The MartianPoster film The MartianSutradara Ridley Scott Produser Simon Kinberg Ridley Scott Aditya Sood Michael Schaefer (produser) Mark Huffam Ditulis oleh Drew Goddard BerdasarkanThe Martianoleh Andy WeirPemeranMatt DamonJessica ChastainKristen WiigJeff DanielsMichael PeñaKate MaraSean BeanSebastian StanChiwetel EjioforPenata musikHarry Gregson-WilliamsSinematograferDariusz WolskiPenyuntingPietro ScaliaPerusahaanproduksiScott Free ProductionsKinberg GenreTSG EntertainmentDistributor...

 

Artikel ini sebatang kara, artinya tidak ada artikel lain yang memiliki pranala balik ke halaman ini.Bantulah menambah pranala ke artikel ini dari artikel yang berhubungan atau coba peralatan pencari pranala.Tag ini diberikan pada Maret 2016. SMP Negeri 6 BanjarmasinInformasiDidirikan1965AkreditasiANomor Statistik Sekolah201156003006Nomor Pokok Sekolah Nasional30304191Kepala SekolahAminsyah, M.PdJumlah kelas37 KelasRentang kelasVII, VIII, IXKurikulumKurikulum 2013Jumlah siswa±...

 

This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages) This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Informalism – news · newspapers · books · scholar · JSTOR (February 2017) (Learn how and when to remove thi...

Hong Kong actor Peter ChanChan in March 2023BornChan Charm Man (1982-10-24) October 24, 1982 (age 41)Hong KongEducationHong Kong Academy for Performing Arts (BFA)OccupationActorYears active2018–presentSpouse Sung Man ​(m. 2019)​Children1 Peter Chan Charm Man (Chinese: 陳湛文, born October 24, 1982) is a Hong Kong actor best known for his debut role in Three Husbands, which earned him nominations for Best New Performer in the 38th Hong Kong Film ...

 

Алгоритм Бройдена - Флетчера - Гольдфарба - Шанно (англ. Broyden–Fletcher–Goldfarb–Shanno (BFGS)) - ітеративний метод числової оптимізації, призначений для знаходження локального максимуму / мінімуму нелінійної функції без обмежень (є спірними слова без обмежень, див. примітка). Даний мет...

 

This article relies largely or entirely on a single source. Relevant discussion may be found on the talk page. Please help improve this article by introducing citations to additional sources.Find sources: Arvid Posse – news · newspapers · books · scholar · JSTOR (October 2023) 2nd Prime Minister of Sweden (1820-1901) Not to be confused with Arvid Mauritz Posse. His Excellency High Well-born CountArvid PosseRoKavKMO StkNS:tOO LLA HedLÖSPortrait from de...

Book based on lectures by Vivekananda Lectures from Colombo to Almora Front cover of 1897 editionAuthorSwami VivekanandaCountryIndiaLanguageEnglishSubjectIndian philosophyPublisherVyjayanti Press, MadrasPublication date1897ISBN9788175050815OCLC276782395TextLectures from Colombo to Almora at Wikisource Lectures from Colombo to Almora (1897) is a book of Swami Vivekananda based on the lectures he delivered in Sri Lanka and India after his return from the West. Vivekananda reached Colombo, Briti...

 

1902 novel by Saki The Westminster Alice First edition cover of The Westminster AliceAuthorHector Hugh Munro (Saki)CountryUnited KingdomLanguageEnglishGenreFantasy novel, parodyPublished1902 (The Westminster Gazette)Media typePrint (hardback)Pagesxii, 152 The Westminster Alice is the name of a collection of vignettes written by Hector Hugh Munro (Saki) in 1902 and published by The Westminster Gazette of London. It is a political parody of Lewis Carroll's two books, Alice's Adventures in ...

 

Strategi Solo vs Squad di Free Fire: Cara Menang Mudah!