Probabilistic peak calling and controlling false discovery rate estimations in transcription factor binding site mapping from ChIP-seq.
Publication Type:
Journal ArticleSource:
Methods in molecular biology (Clifton, N.J.), Volume 674, p.161-77 (2010)Keywords:
2010, Binding Sites, Center-Authored Paper, Chromatin Immunoprecipitation, False Positive Reactions, GA-Binding Protein Transcription Factor, Humans, Internet, Jurkat Cells, Probability, Public Health Sciences Division, Regulatory Sequences, Nucleic Acid, Reproducibility of Results, Sequence Analysis, DNA, Software, TRANSCRIPTION FACTORSAbstract:
Localizing the binding sites of regulatory proteins is becoming increasingly feasible and accurate. This is due to dramatic progress not only in chromatin immunoprecipitation combined by next-generation sequencing (ChIP-seq) but also in advanced statistical analyses. A fundamental issue, however, is the alarming number of false positive predictions. This problem can be remedied by improved peak calling methods of twin peaks, one at each strand of the DNA, kernel density estimators, and false discovery rate estimations based on control libraries. Predictions are filtered by de novo motif discovery in the peak environments. These methods have been implemented in, among others, Valouev et al.'s Quantitative Enrichment of Sequence Tags (QuEST) software tool. We demonstrate the prediction of the human growth-associated binding protein (GABPalpha) based on ChIP-seq observations.
Doing Business with Arnold Library
Weintraub Building, B1-010
(206) 667-4314
library@fredhutch.org
More About Arnold Library
Quick Links
- ILLiad/ILL Form
- Journals List
- Ovid - Medline
- Web of Science
- Pubmed
- Libguides
- Library Catalog
- Researcher Profiles
- Library Affiliations & Memberships
- Fred Hutch Papers (Intranet)