Rediscovering Publicly Available Single-cell Data with the DISCO Platform

Since its inception in 2009, single-cell RNA-seq techniques have evolved, increasing throughput and reducing costs. With a growing number of published studies, efficient data retrieval is crucial. DISCO was created as a comprehensive database of single-cell RNA-seq data, enabling exploration of cell...

Full description

Saved in:
Bibliographic Details
Main Author: CHEN, Jinmiao
Format: text
Published: Institutional Knowledge at Singapore Management University 2024
Online Access:https://ink.library.smu.edu.sg/sgor2024/programme/schedule/13
https://ink.library.smu.edu.sg/context/sgor2024/article/1013/viewcontent/10_JinmiaoChen_V2.0.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
id sg-smu-ink.sgor2024-1013
record_format dspace
spelling sg-smu-ink.sgor2024-10132024-11-20T06:48:18Z Rediscovering Publicly Available Single-cell Data with the DISCO Platform CHEN, Jinmiao Since its inception in 2009, single-cell RNA-seq techniques have evolved, increasing throughput and reducing costs. With a growing number of published studies, efficient data retrieval is crucial. DISCO was created as a comprehensive database of single-cell RNA-seq data, enabling exploration of cell types and gene expressions in various tissues. Now, DISCO hosts over 100 million single-cell profiles from 16,734 samples, reflecting a fivefold increase since its first version. We have curated metadata, categorized samples, and refined cell type annotations using a harmonized reference. DISCO platform provides online tools for data integration, cell type annotation, projecting query dataset to atlases, and gene set enrichment analysis. The DISCO R toolkit supports offline analyses. Our data also aids in training foundation AI models like scGPT and scFoundation, enhancing hypothesis generation and data mining. DISCO’s continued updates and extensive dataset underscore its role as a key resource in the field. 2024-11-12T22:40:00Z text application/pdf https://ink.library.smu.edu.sg/sgor2024/programme/schedule/13 https://ink.library.smu.edu.sg/context/sgor2024/article/1013/viewcontent/10_JinmiaoChen_V2.0.pdf Singapore Open Research Conference 2024 Institutional Knowledge at Singapore Management University
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
description Since its inception in 2009, single-cell RNA-seq techniques have evolved, increasing throughput and reducing costs. With a growing number of published studies, efficient data retrieval is crucial. DISCO was created as a comprehensive database of single-cell RNA-seq data, enabling exploration of cell types and gene expressions in various tissues. Now, DISCO hosts over 100 million single-cell profiles from 16,734 samples, reflecting a fivefold increase since its first version. We have curated metadata, categorized samples, and refined cell type annotations using a harmonized reference. DISCO platform provides online tools for data integration, cell type annotation, projecting query dataset to atlases, and gene set enrichment analysis. The DISCO R toolkit supports offline analyses. Our data also aids in training foundation AI models like scGPT and scFoundation, enhancing hypothesis generation and data mining. DISCO’s continued updates and extensive dataset underscore its role as a key resource in the field.
format text
author CHEN, Jinmiao
spellingShingle CHEN, Jinmiao
Rediscovering Publicly Available Single-cell Data with the DISCO Platform
author_facet CHEN, Jinmiao
author_sort CHEN, Jinmiao
title Rediscovering Publicly Available Single-cell Data with the DISCO Platform
title_short Rediscovering Publicly Available Single-cell Data with the DISCO Platform
title_full Rediscovering Publicly Available Single-cell Data with the DISCO Platform
title_fullStr Rediscovering Publicly Available Single-cell Data with the DISCO Platform
title_full_unstemmed Rediscovering Publicly Available Single-cell Data with the DISCO Platform
title_sort rediscovering publicly available single-cell data with the disco platform
publisher Institutional Knowledge at Singapore Management University
publishDate 2024
url https://ink.library.smu.edu.sg/sgor2024/programme/schedule/13
https://ink.library.smu.edu.sg/context/sgor2024/article/1013/viewcontent/10_JinmiaoChen_V2.0.pdf
_version_ 1816859145119203328