【在 g*********s 的大作中提到】 : think positively: at least now u know ur resume has spelling issues. : you should thank this interviewer who is careful and serious.
【在 K**4 的大作中提到】 : 不可行! : 可以做,但是false positive 太大,没有实际意义了 : 主要是现有mRNA 数据不完整,无法正确locate real transcription start site, : 习惯上人们要提取TSS 前500-4000 bp for promoter analysis, 但是实际的很可能在 : 50-100kb以外。
l*1
12 楼
if relative to stress response: you can try RAD-seq: Restriction site Associated DNA Sequencing http://www.molbio.uoregon.edu/facres/johnson.html HTTPS: //www.wiki.ed.ac.uk/display/RADSequencing/Home if relative to histone modification you can try BS-Seq: Bisulphite Sequencing http://seqanswers.com/wiki/BS-Seq original hint was from one Nature job posting: http://www.nature.com/naturejobs/science/jobs/344164-postdoctor >Postdoctoral Fellow in Evolutionary Bioinformatics : Vienna, Austria >A postdoctoral position in bioinformatics is immediately available in the research group of Ovidiu Paun at >the University of Vienna (see http://www.botanik.univie.ac.at/systematik/personnel/Paun.htm). below ignored > The candidate will play a lead role in analysing next generation sequencing data including RNA-seq, >smRNA-seq, BS-seq and RAD-seq. The fellow will be also involved in identifying outliers and performing > environmental correlations. >We are looking for a highly self-motivated and independent candidate, yet willing to work in a team->effort. The fellow should hold a relevant PhD degree in bioinformatics or related fields before starting this >position. Fluency in a major programming language such as perl or python and a strong publication >record are expected. The successful candidate should also be able to demonstrate experience with >computational analyses of high- throughput genomic data. >To be considered please send your application per email to ovidiu.paun’@‘ univie.ac.at including your CV, ........ >The latest preferred start >date is March 1st, 2014. or http://evol.mcmaster.ca/~brian/evoldir/PostDocs/Vienna.Evolutio http://evol.mcmaster.ca/cgi-bin/my_wrap/brian/evoldir/PostDocs/
Are those transcription factor binding site prediction softwares making sense? I mean if the chip data are not available, what can we do about the regulatory elements on the basis of the sequences?
makes NO sense at all in my perspective 你可以看到很多prediction的软件/网站;不同网站预测出来的结果完全不一样。 TF binding motif,我想都是非常variable的吧(http://en.wikipedia.org/wiki/Position-specific_scoring_matrix),当然我是外行,我想请教做TF binding的内行,到现在能准确identify比如MEF2A的binding site就一定是比如ATGGCC(我随便乱说的)? 但根据俺的经验,纵然MEF2A是exclusively的bind到ATGGCC;也不是说每个ATGGCC都一 定会被MEF2A target,一定还是要做实验的 现在我比较相信的是:加入chip-seq的数据表明TF会bind在某个基因的某个loci,然后 这个loci的某个SNP被软件预测可以改变binding motif;那么我相信这个SNP 会通过这 个TF binding调控基因的表达
【在 c***y 的大作中提到】 : Are those transcription factor binding site prediction softwares making : sense? I mean if the chip data are not available, what can we do about the : regulatory elements on the basis of the sequences? : : binding
非植物 或哺乳的数据库区别也 植物的也 一样在完善 关键是楼主有无NGS and HMM (Hidden Markov Models 的背景或能找到那种背景的 合作人 pls refer PMID 23435661 by Van der Does D et al., (2013). Salicylic acid suppresses jasmonic acid signaling downstream of SCFCOI1-JAZ by targeting GCC promoter motifs via transcription factor ORA59. Plant Cell. 25: 744-61. Abstract: ignored In silico promoter analysis of the SA/JA crosstalk transcriptome revealed that the 1-kb promoter regions of JA-responsive genes that are suppressed by SA are significantly enriched in the JA-responsive GCC-box motifs below ignored too http://www.ncbi.nlm.nih.gov/pubmed/23435661 full pdf link: HTTP double dot //www.plantcell.org/content/25/2/744.full.pdf or Wong KC et al., (2013). DNA motif elucidation using belief propagation. Nucleic Acids Res. 41: e153. http://www.ncbi.nlm.nih.gov/pubmed/23814189 Abstract Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ~10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. below ignored http://www.cs.toronto.edu/~wkc/kmerHMM/downloads.html or http://www.cs.utoronto.ca/~wkc/ http://www.utoronto.ca/zhanglab/people.html
pls refer fresh new both papers: a, by Morozov VY and Ioshikhes IP. (2013). Optimized Position Weight Matrices in Prediction of Novel Putative Binding Sites for Transcription Factors in the Drosophila melanogaster Genome. PLoS One. 8: e68712. Abstract Position weight matrices (PWMs) have become a tool of choice for the identification of transcription factor binding sites in DNA sequences. below ignored In the present study, we extended this technique originally tested on single examples of transcription factors (TFs) and showed its capability to optimize PWM performance to predict new binding sites in the fruit fly genome. We propose refined PWMs in mono- and dinucleotide versions similarly computed for a large variety of transcription factors of Drosophila melanogaster. Along with the addition of many auxiliary sites the optimization includes variation of the PWM motif length, the binding sites location on the promoters and the PWM score threshold. To assess the predictive performance of the refined PWMs we compared them to conventional TRANSFAC and JASPAR sources. below ignored http://www.ncbi.nlm.nih.gov/pubmed/23936309 or b, by Radivojac P et al., (2013). A large-scale evaluation of computational protein function prediction. Nat Methods. 10: 221-7. Abstract above ignored Fifty-four methods representing the state of the art for protein function prediction were evaluated on a target set of 866 proteins from 11 organisms. Two findings stand out: (i) today's best protein function prediction algorithms substantially outperform widely used first-generation methods, with large gains on all types of targets; and (ii) although the top methods perform well enough to guide experiments, there is considerable need for improvement of currently available tools. http://www.ncbi.nlm.nih.gov/pubmed/23353650
【在 c***y 的大作中提到】 : Are those transcription factor binding site prediction softwares making : sense? I mean if the chip data are not available, what can we do about the : regulatory elements on the basis of the sequences? : : binding
l*1
20 楼
Pls refer one review By Madrigal P and Krajewski P. (2012). Current bioinformatic approaches to identify DNase I hypersensitive sites and genomic footprints from DNase-seq data. Front Genet. 3:230. cited from its pp2: >With sufficiently deepsequencing,the >so-called“digital genomic footprinting” >technique can reveal single protein- >binding events(Hesselberth et al.,2009). >Unlike ChIP-seq,which is specific for the >protein under study,footprints identify >narrow DNA regions that can be bound >by any factor(Hager,2009),showing sig- >nificant enrichment for known motifs >upstream of the transcription start sites >(TSSs). http://www.ncbi.nlm.nih.gov/pubmed/23118738 or Zhang W et al., (2012), Genome-wide identification of regulatory DNA elements and protein-binding footprints using signatures of open chromatin in Arabidopsis. Plant Cell. 24: 2719-31. http://www.ncbi.nlm.nih.gov/pubmed/22773751