Accurate and Efficient Gene Function Prediction using a Multi-Bacterial Network
dc.contributor.author | Law, Jeffrey N. | en |
dc.contributor.author | Kale, Shiv D. | en |
dc.contributor.author | Murali, T. M. | en |
dc.date.accessioned | 2020-11-03T14:26:58Z | en |
dc.date.available | 2020-11-03T14:26:58Z | en |
dc.date.issued | 2019-05-24 | en |
dc.description.abstract | The rapid rise in newly sequenced genomes requires the development of computational methods to supplement experimental functional annotations. The challenge that arises is to develop methods for gene function prediction that integrate information for multiple species while also operating on a genomewide scale. We develop a label propagation algorithm called FastSinkSource and apply it to a sequence similarity network integrated with species-specific heterogeneous data for 19 pathogenic bacterial species. By using mathematically-provable bounds on the rate of progress of FastSinkSource during power iteration, we decrease the running time by a factor of 100 or more without sacrificing prediction accuracy. To demonstrate scalability, we expand to a 73-million edge network across 200 bacterial species while maintaining accuracy and efficiency improvements. Our results point to the feasibility and promise of multi-species, genomewide gene function prediction, especially as more experimental data and annotations become available for a diverse variety of organisms. | en |
dc.description.sponsorship | The research is based upon work supported by the Office of the Director of National Intelligence (ODNI), Intelligence Advanced Research Projects Activity (IARPA), via the Army Research Office (ARO) under cooperative Agreement Number [W911NF-17-2-0105]. | en |
dc.format.extent | 34 pages | en |
dc.format.mimetype | application/pdf | en |
dc.identifier.doi | https://doi.org/10.1101/646687 | en |
dc.identifier.uri | http://hdl.handle.net/10919/100773 | en |
dc.language.iso | en | en |
dc.rights | Creative Commons Attribution 4.0 International | en |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | en |
dc.title | Accurate and Efficient Gene Function Prediction using a Multi-Bacterial Network | en |
dc.title.serial | Virginia Tech | en |
dc.type | Article | en |
dc.type.dcmitype | Text | en |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 646687v1.full.pdf
- Size:
- 4.44 MB
- Format:
- Adobe Portable Document Format
- Description: