Visualization and sharing of genomics data via a cloud based system

The Final Year Project, Visualization and Sharing of Genomics Data via a Cloud Based System, documented on the relationships between Cloud Computing, Next-Generation-Sequencing (NGS), Galaxy, Integrated Genome Browser (IGB) and UCSC Genome Browser. Due to the vast amount of Genomics data involved in...

Full description

Saved in:
Bibliographic Details
Main Author: Chen, Guohao
Other Authors: Zheng Jie
Format: Final Year Project
Language:English
Published: 2015
Subjects:
Online Access:http://hdl.handle.net/10356/62704
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-62704
record_format dspace
spelling sg-ntu-dr.10356-627042023-03-03T20:33:39Z Visualization and sharing of genomics data via a cloud based system Chen, Guohao Zheng Jie School of Computer Engineering Bioinformatics Research Centre DRNTU::Engineering::Computer science and engineering::Information systems The Final Year Project, Visualization and Sharing of Genomics Data via a Cloud Based System, documented on the relationships between Cloud Computing, Next-Generation-Sequencing (NGS), Galaxy, Integrated Genome Browser (IGB) and UCSC Genome Browser. Due to the vast amount of Genomics data involved in the renowned technology Next-Generation-Sequencing (NGS), Galaxy (An open source, web-based platform for data intensive biomedical research) adopted Cloud Computing as a potential methodology to remedy the storage, processing and sharing of data. A detailed guide from depositing data, installing of Galaxy to the hosting of Galaxy were included in this report with proper configurations and recommendations attached. It is important to note that Galaxy no longer supported the distribution of Windows platform and thus, Ubuntu (A community developed, GNU/Linux based Free/Open Source operating system) was adopted as a substitution for development in a Linux platform. Development on Galaxy was also made possible by leveraging on the API key generated by Galaxy where users could perform analysis on a Terminal instead. Galaxy was further migrated to existing Cloud infrastructure of Nanyang Technological University, School of Computer Engineering where users were able to take advantage of its high availability, performance capability and the privilege of enjoying scalability in the computing resources. Benchmarking was performed on a single workstation together with NTU-SCE Cloud services and the result shows the latter outperformed the former significantly. External web applications like UCSC Genome Browser and Integrated Genome Browser (IGB) were also introduced to enhanced users’ experience in performing data analysis. A total of three recommendations each for hosting Galaxy on the Cloud concluded that the trade-off for performance and availability comes with great financial cost. Bachelor of Engineering (Computer Science) 2015-04-27T08:34:34Z 2015-04-27T08:34:34Z 2015 2015 Final Year Project (FYP) http://hdl.handle.net/10356/62704 en Nanyang Technological University 85 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Information systems
spellingShingle DRNTU::Engineering::Computer science and engineering::Information systems
Chen, Guohao
Visualization and sharing of genomics data via a cloud based system
description The Final Year Project, Visualization and Sharing of Genomics Data via a Cloud Based System, documented on the relationships between Cloud Computing, Next-Generation-Sequencing (NGS), Galaxy, Integrated Genome Browser (IGB) and UCSC Genome Browser. Due to the vast amount of Genomics data involved in the renowned technology Next-Generation-Sequencing (NGS), Galaxy (An open source, web-based platform for data intensive biomedical research) adopted Cloud Computing as a potential methodology to remedy the storage, processing and sharing of data. A detailed guide from depositing data, installing of Galaxy to the hosting of Galaxy were included in this report with proper configurations and recommendations attached. It is important to note that Galaxy no longer supported the distribution of Windows platform and thus, Ubuntu (A community developed, GNU/Linux based Free/Open Source operating system) was adopted as a substitution for development in a Linux platform. Development on Galaxy was also made possible by leveraging on the API key generated by Galaxy where users could perform analysis on a Terminal instead. Galaxy was further migrated to existing Cloud infrastructure of Nanyang Technological University, School of Computer Engineering where users were able to take advantage of its high availability, performance capability and the privilege of enjoying scalability in the computing resources. Benchmarking was performed on a single workstation together with NTU-SCE Cloud services and the result shows the latter outperformed the former significantly. External web applications like UCSC Genome Browser and Integrated Genome Browser (IGB) were also introduced to enhanced users’ experience in performing data analysis. A total of three recommendations each for hosting Galaxy on the Cloud concluded that the trade-off for performance and availability comes with great financial cost.
author2 Zheng Jie
author_facet Zheng Jie
Chen, Guohao
format Final Year Project
author Chen, Guohao
author_sort Chen, Guohao
title Visualization and sharing of genomics data via a cloud based system
title_short Visualization and sharing of genomics data via a cloud based system
title_full Visualization and sharing of genomics data via a cloud based system
title_fullStr Visualization and sharing of genomics data via a cloud based system
title_full_unstemmed Visualization and sharing of genomics data via a cloud based system
title_sort visualization and sharing of genomics data via a cloud based system
publishDate 2015
url http://hdl.handle.net/10356/62704
_version_ 1759853671651213312