-
Notifications
You must be signed in to change notification settings - Fork 2
Description
Project:
Transcription factor binding sites are important regulatory elements found upstream or downstream of a gene's transcription start site. These DNA-binding sites are non-exact, often represented by positional probabilities in a matrix, and also appear to have slightly different affinities across different ChIP-Seq assays. Here, we propose a framework to evaluate profiles from DNA-binding site collections (JASPAR, HocoMoco, UniPROBE, Jolma et al., TRANSFAC) versus what is found in peaks called from ChIP-Seq assays. The input is a position-weight matrix (PWM) representing the DNA profile for a given binding site of interest. The first part would automatically query the ENCODE project's API for experiments targeting the appropriate gene for the profile. The sequence at the respective peaks would be extracted for scanning using the PWM. The goal is to find how well the PWM agrees with what's found in experimental data. The output would be a summary of the profile's representation across sequences, and statistics on the number of possible matches found per sequences. Depending on which experiments are queried, further aims can include:
- Comparing the profiles from alternative databases and versions to identify the most accurate representations per experiments.
- Determine whether a database better represents a given organism's binding site (Mouse or Human).
- Using the same approach, identify profiles for binding sites not targeted by the experiment but also frequently located on the peaks.
Ideally, this project would be about 1.5-2.0 days of development, and 1-1.5 days of experimentation and attempt to answer questions using the project. Interesting skills for these projects would include: -Software development, scripting, object-oriented programming, REST APIs.
- Experience with transcription factor binding sites, motif discovery.
- Prior research with transcription factors and co-factor interactions.
Project Lead: Manuel Belmadani / @mbelmadani / Industry Professional / University of British Columbia