Creative Commons License
This blog by Tommy Tang is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

My github papge

Friday, October 28, 2022

16 resources for re-analyzing public expression data.

1.  https://rnama.com/docs/search-evaluation  RNA meta Analysis has ~26,700 studies (5,717 RNA-Seq and 20,955 Microarray)

2.  [refine.bio](https://www.refine.bio/) will have harmonized over 60,000 gene expression experiments

3.  BioJupies https://maayanlab.cloud/biojupies/

4.  [Recount2-FANTOM](https://www.biorxiv.org/content/10.1101/659490v1) Recounting the FANTOM Cage Associated Transcriptome. Long non-coding RNAs.

5.  Recount3 https://rna.recount.bio/

6.  [dee2](http://dee2.io/) Digital Expression Explorer 2. Digital Expression Explorer 2 (DEE2) is a repository of uniformly processed RNA-seq data mined from public data obtained from NCBI Short Read Archive. By Ziemann Mark et.al! Version 2 of dee.

7.  Extracting allelic read counts from 250,000 human sequencing runs in Sequence Read Archive https://www.biorxiv.org/content/10.1101/386441v1?rss=1

8.   [MetaSRA: normalized sample-specific metadata for the Sequence Read Archive](http://biorxiv.org/content/early/2016/11/30/090506)

9.   [ARCHS4: Massive Mining of Publicly Available RNA-seq Data from Human and Mouse](https://amp.pharm.mssm.edu/archs4/) ARCHS4 provides access to gene counts from HiSeq 2000, HiSeq 2500 and NextSeq 500 platforms for human and mouse experiments from GEO and SRA.

10.  [DEP-reads: Uniformlly processed public RNA-Seq data](http://bioinformatics.sdstate.edu/reads/) Read counts data for 5,470 human and mouse datasets from ARCHS4 v6 and 12,670 datasets from DEE2 for 9 model organisms by steven Ge.

11.  [SRA-explorer](https://ewels.github.io/sra-explorer/) This tool aims to make datasets within the Sequence Read Archive more accessible. 

12.  [intropolis](https://github.com/nellore/intropolis) is a list of exon-exon junctions found across **21,504** human RNA-seq samples on the Sequence Read Archive (SRA) from spliced read alignment to hg19 with Rail-RNA.

13.   [batch recompute ~20,000 RNA-seq samples from larget sequencing project such as TCGA, TARGET and GETEX](https://genome-cancer.soe.ucsc.edu/proj/site/xena/datapages/?host=https://toil.xenahubs.net). Used `hg38` and `gencode v21` as annotation.

14.   [A cloud-based workflow to quantify transcript-expression levels in public cancer compendia](http://biorxiv.org/content/early/2016/07/12/063552) used kallisto for TCGA/CCLE datasets and gencode v24 as annotation.

15.   [MiPanda](http://www.mipanda.org/) is an online resource for the interrogation and visualization of gene expression data from the myriad of publicly available cancer and normal next generation sequencing datasets.

16.   [Curation of over 10,000 transcriptomic studies to enable data reuse](https://www.biorxiv.org/content/10.1101/2020.07.13.201442v1)

Tuesday, October 25, 2022

10 courses to get you started with bioinformatics

1/ http://rafalab.dfci.harvard.edu/pages/harvardx.html by Rafa

2/ https://github.com/quinlan-lab/applied-computational-genomics#course-lecture-slides 

by Aaron Quinlan, the creator of bedtools and many other cool tools.


3/ https://www.bioinformaticsalgorithms.org/ You can find the video classes on Coursera 


4/ http://www.personal.psu.edu/iua1/courses/2014-BMMB-852.html by Istvan Albert, the creator of [biostars](https://www.biostars.org/).


5/  Introduction to Bioinformatics and Computational Biology https://liulab-dfci.github.io/bioinfo-combio/ by @XShirleyLiu 

glad to contribute a little myself.


6/ data carpentry workshops  https://datacarpentry.org/lessons/#genomics-workshop I am honored to serve as the curriculum committee chair 


7/ Computational Genomics: Applied Comparative Genomics https://github.com/schatzlab/appliedgenomics2018

8/ Introduction to Computational Biology https://biodatascience.github.io/compbio/  by Mike Love  @mikelove


9/ [MIT Computational Biology: Genomes, Networks, Evolution, Health - Fall 2018 - 6.047/6.878/HST.507](https://www.youtube.com/playlist?list=PLypiXJdtIca6GBQwDTo4bIEDV8F4RcAgt) by Manolis Kellis


10/ An introduction to Applied Bioinformatics http://readiab.org/introduction.html Very nice book with python code.

Sunday, October 23, 2022

5 websites to analyze GEO RNAseq data without a single line of code


4. GREIN : GEO RNA-seq experiments interactive navigator for re-analyzing GEO RNA-seq data https://hub.docker.com/r/ucbd2k/grein/

5. ImaGEO: Integrative Meta-Analysis of GEO Data https://imageo.genyo.es/

Bonus https://www.ebi.ac.uk/gxa/home more than GEO
one more Gemma https://gemma.msl.ubc.ca/home.html

Thursday, October 20, 2022

12 websites to learn computation and many others!

 1/  coursera https://www.coursera.org/ The first website I used. I took a data science Specialization https://www.coursera.org/specializations/jhu-data-science  and https://www.coursera.org/learn/bioinformatics

3/ udactiy https://www.udacity.com/ I took R courses, ggplot2,github and intro to ML 

4/ udemy https://www.udemy.com/ I took several python courses there.

5/ MIT opencourseware https://ocw.mit.edu/  1806 linear algebra and many others!

7/ youtube channel 3blue1brown https://www.youtube.com/channel/UCYO_jab_esuFRV4b17AJtAw blow you away with cristal clear explanations. I watched the linear algebra series 

10/  Hubspot https://academy.hubspot.com/ this one is new to me

11/ EBML-EBI training https://www.ebi.ac.uk/training/online/ bioinformatics courses

12/  skillup https://www.simplilearn.com/skillup-free-online-courses this is new to me as well

Wednesday, October 19, 2022

12 web tools to explore genomics data


1. cbioportal https://cbioportal.org explore genomic datasets at the tips of your fingers
2. xena https://xena.ucsc.edu, a UCSC effort. Everyone needs to learn how to use UCSC genome browser https://genome.ucsc.edu
3. depmap portal https://lnkd.in/et3uDeci Cancer Cell Line Encyclopedia
4. TCGA RNA fusion portal https://tumorfusions.org
5. https://lnkd.in/e3P7td-w
6. Tumor Immune Syngeneic MOuse (TISMO) database http://tismo.cistrome.org
7. PDX models https://lnkd.in/ezby9kns
8. https://lnkd.in/ev6EUkwf Tumor Immune Dysfunction and Exclusion
9. http://timer.cistrome.org TIMER is a comprehensive resource for systematical analysis of immune infiltrates across diverse cancer types
10. genePattern https://genepattern.org
11. https://lnkd.in/ekFBdjfW
12. draw mutation for a protein https://lnkd.in/em8GmHQM