Resurrecting the anscombosaurus Rex : why learning exploratory data analysis is critical for biologists
There is growing abuse of Confirmatory Data Analysis (CDA) methods such as p-values for significance in research. We recommend another data analysis method known as Exploratory Data Analysis (EDA) to complement CDA methods to gain better insights of our data. This study aims to design an introductor...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/77284 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-77284 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-772842023-02-28T18:05:58Z Resurrecting the anscombosaurus Rex : why learning exploratory data analysis is critical for biologists Ho, Sung Yang Goh Wen Bin Wilson School of Biological Sciences DRNTU::Science::Biological sciences There is growing abuse of Confirmatory Data Analysis (CDA) methods such as p-values for significance in research. We recommend another data analysis method known as Exploratory Data Analysis (EDA) to complement CDA methods to gain better insights of our data. This study aims to design an introductory EDA tutorial and using feedback from participants and current materials taught in online courses to design subtopics of EDA that can be used in the field of biology and if possible generalized to other fields. The findings of our research suggest that there is a gap in knowledge between undergraduates to postgraduates. Students are only exposed to CDA methods and there are multiple misconceptions when it comes to graphical and statistical interpretations. The key findings of this study suggest that not only are biology students not proficient in statistics, but they are also lacking in data science. Hence, there is a pressing need to educate data science better to the biology field. The final design of the subtopics of EDA after content analysis aims to teach students on the importance of clean data, the power of data visualisation through the use of the “ggplot2” R package and patterns of significance when analysing data. Bachelor of Science in Biological Sciences 2019-05-24T01:06:28Z 2019-05-24T01:06:28Z 2019 Final Year Project (FYP) http://hdl.handle.net/10356/77284 en Nanyang Technological University 57 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Science::Biological sciences |
spellingShingle |
DRNTU::Science::Biological sciences Ho, Sung Yang Resurrecting the anscombosaurus Rex : why learning exploratory data analysis is critical for biologists |
description |
There is growing abuse of Confirmatory Data Analysis (CDA) methods such as p-values for significance in research. We recommend another data analysis method known as Exploratory Data Analysis (EDA) to complement CDA methods to gain better insights of our data. This study aims to design an introductory EDA tutorial and using feedback from participants and current materials taught in online courses to design subtopics of EDA that can be used in the field of biology and if possible generalized to other fields. The findings of our research suggest that there is a gap in knowledge between undergraduates to postgraduates. Students are only exposed to CDA methods and there are multiple misconceptions when it comes to graphical and statistical interpretations. The key findings of this study suggest that not only are biology students not proficient in statistics, but they are also lacking in data science. Hence, there is a pressing need to educate data science better to the biology field. The final design of the subtopics of EDA after content analysis aims to teach students on the importance of clean data, the power of data visualisation through the use of the “ggplot2” R package and patterns of significance when analysing data. |
author2 |
Goh Wen Bin Wilson |
author_facet |
Goh Wen Bin Wilson Ho, Sung Yang |
format |
Final Year Project |
author |
Ho, Sung Yang |
author_sort |
Ho, Sung Yang |
title |
Resurrecting the anscombosaurus Rex : why learning exploratory data analysis is critical for biologists |
title_short |
Resurrecting the anscombosaurus Rex : why learning exploratory data analysis is critical for biologists |
title_full |
Resurrecting the anscombosaurus Rex : why learning exploratory data analysis is critical for biologists |
title_fullStr |
Resurrecting the anscombosaurus Rex : why learning exploratory data analysis is critical for biologists |
title_full_unstemmed |
Resurrecting the anscombosaurus Rex : why learning exploratory data analysis is critical for biologists |
title_sort |
resurrecting the anscombosaurus rex : why learning exploratory data analysis is critical for biologists |
publishDate |
2019 |
url |
http://hdl.handle.net/10356/77284 |
_version_ |
1759856907858739200 |