Puig et al., 2023: bulk and single-cell RNAseq analysis scripts

This repository simply captures the data analysis scripts used for the paper in Rmd notebook format. The notebooks briefly discuss the rationale behind the approaches taken, and show the closest programmatical version to the figures in the paper. The pipeline has been written so that a relatively unskilled user can simply clone a 'lightweight' repository and run each script straight away (i.e. the scripts are intended to be platform-independent, take care of installing the necessary libraries, and all other resources are either included already or downloaded programmatically). As I (JdN) am relatively unskilled myself, and tests have only been done in RStudio using Mac OS X, some adaptation may be required - please feel free to raise an issue or contact me directly.

Description

Scripts #1 to #4 deal with the bulk RNAseq analysis of data generated by Jerome Korzelius and colleagues. This pipeline was written by Joaquín de Navascués, based on earlier work by Aleix Puig-Barbé.

Scripts #5 to #7 deal with the analysis of scRNAseq data already published by the Perrimon lab (Hung et al., 2020) and the Fly Cell Atlas consortium (Li et al., 2022). The pipeline was written by Vinícius Dias Nirello, and adapted for sharing by Joaquín de Navascués. Our pipeline uses data from Hung et al., (2020) whose integration was computed by the authors (and shared directly with us) from their data available at GEO. Because of the differences in C++ computing libraries and compilers working under the hood of R in different machines, this integrated data and their UMAP representation cannot be reproduced easily. Therefore we provide another script (integration_Hung2020, based on the scripts from that publication) that shows how the analysis could have been done purely from data deposited in public repositories, and pipe the results into script #5.

Note: This is a lightweight version of the scripts - no data are stored here, and instead they are automatically downloaded (and often deleted after loading); the only figures produced are for the notebooks (those for the data are saved in a different folder). However, you can obtain an archived version with the datasets and final figures from Zenodo:

.

Authors

Joaquín de Navascués @jdenavascues/ORCID
Vinícius Dias Nirello Google Scholar/GitHub
Aleix Puig-Barbé @AleixPuig7/ORCID/GitHub

Acknowledgments

Code snippets taken from Stack Overflow and other places are linked where they are used.

Work supported by:

funding from Cardiff University and the University of Essex
NC3Rs SKT grant NC/W001047/1
DFG Grant KO5594/1-1
an EMBO Long-Term Fellowship
FAPESP fellowship #2021/00393-9
FAPESP São Paulo Excellence Chair #2019/16113-5
ERC Advanced Grant no. 268515

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
figures		figures
input		input
notebook_figs		notebook_figs
output		output
resources		resources
.Rhistory		.Rhistory
.gitattributes		.gitattributes
.gitignore		.gitignore
1_differential_gene_expression.Rmd		1_differential_gene_expression.Rmd
1_differential_gene_expression.html		1_differential_gene_expression.html
2_dge_descriptive_viz.Rmd		2_dge_descriptive_viz.Rmd
2_dge_descriptive_viz.html		2_dge_descriptive_viz.html
3_dge_functional_class.Rmd		3_dge_functional_class.Rmd
3_dge_functional_class.html		3_dge_functional_class.html
4_dge_DNA_motifs.Rmd		4_dge_DNA_motifs.Rmd
4_dge_DNA_motifs.html		4_dge_DNA_motifs.html
5_SCdata_integration.Rmd		5_SCdata_integration.Rmd
5_SCdata_integration.html		5_SCdata_integration.html
6_SCpseudotime.Rmd		6_SCpseudotime.Rmd
6_SCpseudotime.html		6_SCpseudotime.html
7_SCexpression_plots.Rmd		7_SCexpression_plots.Rmd
7_SCexpression_plots.html		7_SCexpression_plots.html
Puigetal2023_bioinformatics_scripts.Rproj		Puigetal2023_bioinformatics_scripts.Rproj
README.md		README.md
doc.css		doc.css
integration_Hung2020.Rmd		integration_Hung2020.Rmd
integration_Hung2020.html		integration_Hung2020.html
utils.R		utils.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Puig et al., 2023: bulk and single-cell RNAseq analysis scripts

Description

Authors

Acknowledgments

About

Releases

Packages

Languages

jdenavascues/bHLH_code_midgut

Folders and files

Latest commit

History

Repository files navigation

Puig et al., 2023: bulk and single-cell RNAseq analysis scripts

Description

Authors

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages