A wet-dry hybrid biologist's take on genetics and genomics. Mostly is about Linux, R, python, reproducible research, open science and NGS. Grab my book to transform yourself to a computational biologist https://divingintogeneticsandgenomics.ck.page/
PEAKZILLA
---------
Peakzilla identifies sites of enrichment and transcription factor binding sites from transcription factor ChIP-seq and ChIP-exo experiments at hight accuracy and resolution. It is designed to perform equally well for data from any species. All necessary parameters are estimated from the data. Peakzilla is suitable for both single and paired end data from any sequencing platform.
Note that peakzilla is not suited for the identification of broad regions of enrichment (e.g. ChIP-seq for histone marks), we recommand using MACS instead: Zhang et al. Model-based Analysis of ChIP-Seq (MACS). Genome Biol (2008) 9(9):R137
3. Taverna http://www.taverna.org.uk/
I got to know it from a blog, never used it, but it looks promising
4. GenomeSpace http://www.genomespace.org/ it integrates many tools together including the above Galaxy and GenePattern
There are just way too many softwares out there for different kind of data analysis. First, get to know what are out there. Then, choose the right one meet your need for research.
update on 04/20/2016.
I noticed that this post is the most frequently visited one. it has been almost 3 years
since I wrote this post. Now, there are various tools for this purpose. See a
post on biostars.
well, I recently just went through the whole process for making a heatmap based on a ChIP-seq data set. If you do not know the technique, google it :) http://en.wikipedia.org/wiki/ChIP-sequencing
Often, you have a ChIP-seq data that are mapped to the reference genome ( a bam file). You want to plot the sequence tag intensity around certain features ( transcription start sites, gene body, enhancers, or any other genomic region you defined).
Seqmonk http://www.bioinformatics.babraham.ac.uk/projects/seqmonk/ from Simon Andrews can also plot this kind of figure very easily, I am just not satisfied with the picture quality, and I want more customized control of the picture.
I will just paste my code below, and it is heavily commented, you should be able to follow it fairly easily.
update on 05/05/2015, I put the code in a gist instead:
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
I hope you have learned something after reading it:)
===============================
update on 09/17/13
arrange the rows in the heatmap by the coverage from strong to weak
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
After being in the research field for a while, I found myself lost in a vast amount of pdf research papers.
I need a good pdf organizer.
especially when I start writing a paper, putting the reference become a headache to me.
That's where Zotero and Mendeley come to help.
A comparison for these two: https://www.youtube.com/watch?v=53QRA_SNSzE
http://www.zotero.org/
Zotero is open source, it can organize other files like images and videos besides pdf.
it teaches you how to start programming with python for biologists. I even bought the pdf ebook for $39. I had a look at the book, and I think it is worthy. I went through first 4 chapters really fast, they are really basic but teach biologists in a really friendly way. I am even thinking to hold a bioinformatics training course in the future :) sounds ambitious, hmm..
I've read the practical computing for biologists by Haddock Dunn http://www.amazon.com/Practical-Computing-Biologists-Steven-Haddock/dp/0878933913
it is a very good introductory book. Besides python, it also teaches you how to use linux command lines and some database basics. By the way, I wrote my first regular expression after reading the book.
well, I feel I just could get some practical problems ( text manipulation, Genomic Interval calculation using pybedtools, NGS by HTSeq) done by python though I began to play around with python in 04/24/2012. It takes time, but the investment definitely is rewarding!!
UCSC Archaeal Genome Browser: Provides you with many research and analysis tools that can be used to examine the genomes of more than 50 microbial species from the domain archaea.