Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
November 15-16, 2012
The Genome Access Course is an intensive two-day introduction to bioinformatics. The course is broken into modules that are each designed to give a broad overview of a given topic, with ample time for examples chosen by the instructors. Each module features a brief lecture describing the theory, methods and tools followed by a set of worked examples that students complete. Students are encouraged to engage instructors during the course with specific tasks or problems that pertain to their own research. The core of the course is the analysis of sequence information framed in the context of completed genome sequences. Featured resources and examples primarily come from mammalian species, but concepts can be applied to any species. The course also features methods to assist the analysis and prioritization of gene lists from large scale microarray gene expression and proteomics experiments. Summaries of the topics are listed below. Sequence, Gene, and Protein Resources NCBI Sequence, Gene, and Protein Resources Model Organism Databases: Mouse Genome Informatics, Rat Genome Database, ZFIN, FlyBase Protein Sequence and Domain Resources: UniProt, PDB, InterPro Proteomics Resources: IPI, ExPaSY, PRIDE microRNA Resources: miRBase, microCosm Targets, TargetScan, PicTar Repositories of High-Throughput Sequence Data Repositories of Gene Expression Data: GEO, ArrayExpress Gene Expression Profiling Resources Gene Ontology Genome Browsers Genome Sequencing and Assembly Gene Annotation Overview and Comparison of Major Genome Browsers: Ensembl, UCSC, NCBI Map Viewer Adding custom tracks Bulk genome retrieval tools: BioMart, UCSC Table Browser de novo Analysis of Sequences Local, Global, Pairwise, and Multiple Sequence Alignments BLAST and BLAT Algorithms Scoring Matrices: PAM, BLOSUM Iterative Profile and Pattern Searches Multiple Sequence Alignment Programs Visualizing & Editing Multiple Alignments Sequence Variation Types of Sequence and Structural Variation SNP Resources: dbSNP Structural Variation Resources: dbVar, DGVa, HGVbase Comparative Genome Analysis and Functional Genomic Elements Finding Putative Regulatory Elements in Genome Sequence by Comparing Genomes Ortholog and Paralog Resources Multicontigview in Ensembl Comparative Tracks in the UCSC Genome Browser DCODE and ENCODE Resources Analysis of High-Throughput Sequence Data Common File Formats: FASTQ, SAM, BAM Quality Control and Diagnostic Analyses Mapping Reads To a Reference Sequence Finding Putative Mutations and Polymorphisms RNA-Seq Data Analysis ChIP-Seq Data Analysis de novo Assembly Galaxy Resources Gene Set Enrichment and Pathway Analysis Prioritizing Genes from Microarray and Proteomics Experiments Gene Set Enrichment Analysis Tools: GSEA, DAVID Pathway Resources: Reactome, HPRD NetPath, KEGG Protein Interaction Resources: MIPS, MINT, BIND, DIP The primary goal of The Genome Access Course is to provide a short, broad introduction to publicly available bioinformatics resources for genome analysis. For more detailed curriculum on algorithms used in computational biology, please see the Computational & Comparative Genomics course. Students interested in the practical aspects of software development are encouraged to apply to the course on Programming for Biology. Students who would like in-depth training in the analysis of next-generation sequencing data (e.g., SNP calling and the detection of structural variants) may be interested in the course on Advanced Sequencing Technologies & Applications.
|
|