|
Open Positions We are always looking for talented individuals with a computational background to join us for an internship, rotation, Bachelor's, or Master's thesis. If you're interested in opportunities within our group, feel free to reach out via email! PhD position to decode the cis-regulatory code with deep learning We are looking for a highly motivated PhD candidate with a Master's degree in Computer Science, Computational Biology, Bioinformatics, or a related field to join our team! A strong background in computational biology and machine learning, particularly with PyTorch or similar deep learning frameworks, is essential. If you have experience in deep learning for genomics, this is an exciting opportunity to investigate the cis-regulatory syntax of the genome at both the transcriptional and translational levels. Your work will contribute to the interpretation of disease-associated non-coding variants and help uncover their mechanisms of action, advancing our understanding of gene regulation and its role in human health. If this sounds like you, we'd love to hear from you! Project: Uncovering Pioneer and Non-Pioneer TF Interactions via Deep Learning on Targeted Perturbation DataWe are offering an interdisciplinary thesis project that aims to decode how pioneer and non-pioneer transcription factors (TFs) shape chromatin accessibility and gene expression. The project leverages the Grand Lab's established degron cell lines to perform rapid, controlled TF degradation—capturing primary regulatory effects at unprecedented temporal resolution. These ATAC-seq, ChIP-seq, and RNA-seq datasets will be integrated in the Sasse Lab using advanced sequence-to-function deep learning approaches to predict differential chromatin accessibility, TF binding, and transcriptional output directly from DNA sequence. Qualifications
Project SummaryTranscription factors (TFs) and chromatin regulators are central to establishing and maintaining cellular identity by controlling gene expression through complex regulatory networks [1]. Pioneer TFs, such as BANP, GABPA, and ZFP143, are distinguished by their ability to remodel chromatin and open previously inaccessible regions, thereby enabling the binding of additional TFs [2]. In contrast, factors like YY1 and MYC lack pioneering activity. Our previous degron-based perturbation experiments show that pioneering activity is locus-specific and cannot be fully compensated by other pioneer TFs. Moreover, TF degradation reveals a striking diversity of regulatory outcomes: while most target loci lose accessibility and transcription, others exhibit opposite or even anticorrelated effects. These observations underscore that the interplay between sequence context, motif composition, and chromatin state in shaping transcriptional outcomes remains poorly understood. To clarify the underlying regulatory sequence syntax, this project will integrate existing ATAC-seq, ChIP-seq, and RNA-seq datasets from controlled degron cell lines to build a sequence-to-function (S2F) deep learning model [3,4]. This model will predict differential chromatin accessibility, TF binding, and gene expression between control and perturbed conditions using sequence features alone. Leveraging rapid, inducible TF depletion will enable the model to learn primary regulatory effects that are difficult to capture in steady-state systems. Together, these controlled genome-wide perturbations and computational approaches will allow systematic exploration of the combinatorial rules that govern pioneer and non-pioneer TF activity across diverse sequence contexts. The trained model will be used to test mechanistic hypotheses about regulatory sequence grammar and to design synthetic promoter elements for experimental validation. Using explainable AI methods, we will identify motifs and sequence features that enhance or inhibit TF-dependent chromatin remodeling [5]. In silico perturbations will allow us to evaluate how changes in motif arrangement, orientation, or co-occurrence influence chromatin accessibility and transcription. By clustering these results, we aim to derive generalizable principles governing pioneer activity and TF cooperativity. Ultimately, this project will generate a predictive framework for TF interactions in pluripotent stem cells (PSCs) and provide fundamental insights into how pioneer factors shape gene regulatory landscapes. More broadly, this approach, combining deep learning, motif-level analysis, and inducible genomic perturbations, offers a powerful strategy for decoding cis-regulatory logic. The resulting principles could guide stem cell engineering efforts by enabling rational design of regulatory elements to enhance PSC identity or modulate differentiation potential, with implications for regenerative medicine and disease modeling. References
1. Isbel, L., Grand, R., Schübeler, D. Nature Reviews Genetics (2022). Generating specificity in genome regulation through transcription factor sensitivity to chromatin https://doi.org/10.1038/s41576-022-00512-6
|
