Diving into Genetics and Genomics: January 2015

Monday, January 26, 2015

use Entrez Direct to access NCBI database

I was reading Applied Bioinformatics 2014 lecture 3 and learned that one can use Entrez Direct to access NCBI database (Pubmed, nucleotide, protein sequence etc).

After installing Entrez Direct, I played around with it:

Commonly-used fields for PubMed queries include:

  [AFFL]  Affiliation       [FILT]  Filter              [MESH]  MeSH Terms
  [ALL]   All Fields        [JOUR]  Journal             [PTYP]  Publication Type
  [AUTH]  Author            [LANG]  Language            [WORD]  Text Word
  [FAUT]  Author - First    [MAJR]  MeSH Major Topic    [TITL]  Title
  [LAUT]  Author - Last     [SUBH]  MeSH Subheading     [TIAB]  Title/Abstract

[PDAT] Date - Publication [UID] UID

Filters that limit search results to subsets of PubMed include:

  humans [MESH]                has abstract [FILT]
  pharmacokinetics [MESH]      historical article [FILT]
  chemically induced [SUBH]    loprovflybase [FILT]
  all child [FILT]             randomized controlled trial [FILT]
  english [FILT]               clinical trial, phase ii [PTYP]
  free full text [FILT]        review [PTYP]

Sequence databases are indexed with a different set of search fields, including:

  [ACCN]  Accession       [GENE]  Gene Name            [PROT]  Protein Name
  [ALL]   All Fields      [JOUR]  Journal              [SQID]  SeqID String
  [AUTH]  Author          [KYWD]  Keyword              [SLEN]  Sequence Length
  [GPRJ]  BioProject      [MLWT]  Molecular Weight     [SUBS]  Substance Name
  [ECNO]  EC/RN Number    [ORGN]  Organism             [WORD]  Text Word
  [FKEY]  Feature Key     [PACC]  Primary Accession    [TITL]  Title
  [FILT]  Filter          [PROP]  Properties           [UID]   UID

and a sample query in the protein database is:

  "alcohol dehydrogenase [PROT] NOT (bacteria [ORGN] OR fungi [ORGN])"

Please refer to the documents for more examples http://www.ncbi.nlm.nih.gov/books/NBK179288/

Friday, January 23, 2015

Install Inkscape on Mac

It has been a while since I wrote my last post. I am now in China and the Internet connection is very bad... I will start my postdoc in Dr.Role Verhaak's lab at MD Anderson Cancer Center. Dr. Verhaak's lab studies genomic alternations of brain tumor by analyzing whole exome sequencing, RNA-seq, whole genome sequencing, methylation and copy-number data. Yes, I am going to do a postdoc on computational biology. For sure, my computational skills would be strengthened in Dr.Verhaak's lab. Moreover, I will not give up my bench skills. I will use experiments to validate the functions of computational predictions.

I am writing a proposal and want to make a figure with Inkscape. It is a very good drawing software to deal with vector based figures, for bitmap based images, Gimp is the right one. On a linux machine (I have a ubuntu machine), inkscape can be installed by:

$sudo apt-get install inkscape

I only have a mac machine now at hand, so I have to install inkscape on my mac. Installation on mac is different from that on linux. Mac OS needs Xquartz to be installed first. I followed instructions here http://stackoverflow.com/questions/21049815/mavericks-trying-to-install-xquartz-x11-for-inkscape-image-not-found and here https://www.youtube.com/watch?v=7kvSc_PYokM

And now it is ready to use! I am excited to start my new job at MD Anderson and I should have an exciting 2015. I will update more frequently on this blog as I learn new stuff in my new lab.

Diving into Genetics and Genomics

My github papge

Monday, January 26, 2015

use Entrez Direct to access NCBI database

Friday, January 23, 2015

Install Inkscape on Mac

Labels

My Blog List