LdClusterView : a system for automated analysis and visualization of genomics data
In the study of genetics, researchers explore billions of deoxyribonucleic acid (DNA) bases to identify biologically interesting patterns. Due to the need to explore this voluminous data, bioinformatics scientists have developed genome browsers to provide researchers with a platform to better...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2018
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/73950 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-73950 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-739502023-03-03T20:40:23Z LdClusterView : a system for automated analysis and visualization of genomics data Salia, Sisi Zheng Jie School of Computer Science and Engineering A*STAR Singapore Immunology Network (SIgN) DRNTU::Engineering In the study of genetics, researchers explore billions of deoxyribonucleic acid (DNA) bases to identify biologically interesting patterns. Due to the need to explore this voluminous data, bioinformatics scientists have developed genome browsers to provide researchers with a platform to better understand the data. Similarly, in this project, Singapore Immunity Network (SIgN) aimed to develop an interactive web-based visualizations platform for the researchers. The visualizations created were LdClusterView, an improvement to the current genome browsers and Biostatistical Network Tool (BNT), a tool to identify interest genes for further analysis. Most of the genome browsers visualized the relationships between different biological layers through multiple graphical plots stacked on top of each other with a common horizontal axis representing the chromosome length. However, it only shows spatial relationship between different biological data at various regions of the chromosome and does not depict the complex relationship between genetic variations. LdClusterView extended the basic layout of stacked plots by incorporating a dendrogram and Sankey plot to describe the relationship between the stacked plots. These improvements allowed illustration of both relationships between the plots and relationships between the internal elements of the plots respectively. However, due to the limitation of the web application to view a large amount of data, only one gene could be displayed at a time. Therefore, another web application tool, BNT, was created to complement LdClusterView. BNT explored an emerging method of associating gene information with other types of biological data by analysing the data through non-parametric tests, plots and sub-network graph in form of Minimum Spanning Tree (MST) to identify interesting gene candidates for further exploration in LdClusterView. Both applications were implemented through HTML, CSS, JavaScript and D3 library. They were both optimized to be easily used by the researchers to explore the data and to produce visualizations for reporting purposes. Bachelor of Engineering (Computer Science) 2018-04-23T02:03:25Z 2018-04-23T02:03:25Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/73950 en Nanyang Technological University 82 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering |
spellingShingle |
DRNTU::Engineering Salia, Sisi LdClusterView : a system for automated analysis and visualization of genomics data |
description |
In the study of genetics, researchers explore billions of deoxyribonucleic acid (DNA) bases to
identify biologically interesting patterns. Due to the need to explore this voluminous data,
bioinformatics scientists have developed genome browsers to provide researchers with a
platform to better understand the data. Similarly, in this project, Singapore Immunity
Network (SIgN) aimed to develop an interactive web-based visualizations platform for the
researchers. The visualizations created were LdClusterView, an improvement to the current
genome browsers and Biostatistical Network Tool (BNT), a tool to identify interest genes for
further analysis.
Most of the genome browsers visualized the relationships between different biological layers
through multiple graphical plots stacked on top of each other with a common horizontal axis
representing the chromosome length. However, it only shows spatial relationship between
different biological data at various regions of the chromosome and does not depict the
complex relationship between genetic variations.
LdClusterView extended the basic layout of stacked plots by incorporating a dendrogram and
Sankey plot to describe the relationship between the stacked plots. These improvements
allowed illustration of both relationships between the plots and relationships between the
internal elements of the plots respectively. However, due to the limitation of the web
application to view a large amount of data, only one gene could be displayed at a time.
Therefore, another web application tool, BNT, was created to complement LdClusterView.
BNT explored an emerging method of associating gene information with other types of
biological data by analysing the data through non-parametric tests, plots and sub-network
graph in form of Minimum Spanning Tree (MST) to identify interesting gene candidates for
further exploration in LdClusterView.
Both applications were implemented through HTML, CSS, JavaScript and D3 library. They were
both optimized to be easily used by the researchers to explore the data and to produce
visualizations for reporting purposes. |
author2 |
Zheng Jie |
author_facet |
Zheng Jie Salia, Sisi |
format |
Final Year Project |
author |
Salia, Sisi |
author_sort |
Salia, Sisi |
title |
LdClusterView : a system for automated analysis and visualization of genomics data |
title_short |
LdClusterView : a system for automated analysis and visualization of genomics data |
title_full |
LdClusterView : a system for automated analysis and visualization of genomics data |
title_fullStr |
LdClusterView : a system for automated analysis and visualization of genomics data |
title_full_unstemmed |
LdClusterView : a system for automated analysis and visualization of genomics data |
title_sort |
ldclusterview : a system for automated analysis and visualization of genomics data |
publishDate |
2018 |
url |
http://hdl.handle.net/10356/73950 |
_version_ |
1759853460282408960 |