Dr Nathan Skene
London, UK
Position Details
Apply by Jan 31, 2021
PhD
London, UK
Application deadline
Jan 31, 2021
Job
Job location
London, UK
Quick Information
Application Deadline
Jan 31, 2021
Start Date
Flexible
Education Required
See description
Experience Level
—
Job
Job location
London, UK
Job Description
Using machine learning to predict cell-type specific effects of genetic variants which influence genome regulation.
This PhD project is focused on using machine learning techniques to develop novel classifiers for predicting how changes in DNA sequences alter genomic regulatory features. Many regulatory proteins recognise particular DNA sequences known as motifs, for instance, EcoRI only binds to GAATTC. DNA sequences can be converted into a machine interpretable format, using one-hot encoding. The candidate will use publicly available and inhouse datasets of genomic regulatory features to train models. Machine learning techniques will be used to predict the cell-type specific regulatory effects of genetic variants. We will provide several true-positive datasets, wherein the effect of genetic mutations on particular regulatory features has been measured. These will form validation datasets to evaluate how well the trained classifier works. We are interested in how improvements in the machine learning approach (e.g. use of transfer learning, recurrent attentional networks or graph convolution networks) can be used to improve upon existing methods. The candidate will use these techniques to identify causal pathways and candidate drug targets for neurodegenerative diseases.
This PhD project is focused on using machine learning techniques to develop novel classifiers for predicting how changes in DNA sequences alter genomic regulatory features. Many regulatory proteins recognise particular DNA sequences known as motifs, for instance, EcoRI only binds to GAATTC. DNA sequences can be converted into a machine interpretable format, using one-hot encoding. The candidate will use publicly available and inhouse datasets of genomic regulatory features to train models. Machine learning techniques will be used to predict the cell-type specific regulatory effects of genetic variants. We will provide several true-positive datasets, wherein the effect of genetic mutations on particular regulatory features has been measured. These will form validation datasets to evaluate how well the trained classifier works. We are interested in how improvements in the machine learning approach (e.g. use of transfer learning, recurrent attentional networks or graph convolution networks) can be used to improve upon existing methods. The candidate will use these techniques to identify causal pathways and candidate drug targets for neurodegenerative diseases.
Requirements
- Applicants must hold (or obtain by October 2020) a First Class or an Upper Second Class degree (or equivalent overseas qualification) in a quantitative discipline
- such as mathematics
- statistics
- computer science or engineering. Imperial would normally expect successful applicants to hold or achieve a Master's degree in a related field.
- Prior experience with programming is essential
- but no experience with biology is necessary. Experience using machine learning methods will be beneficial. Ideally the candidate will have experience using R and version control.
Related Domains
Job
Job location
Coordinates pending.
London, UK
Related Seminars
Seminar60%
SSFN Webinar - Depression and antidepressants
neuro
Apr 27, 2025
Sahlgrenska Academy
Seminar60%
Reading Scenes
neuro
Apr 28, 2025
Ludwig-Maximilians-Universität München
Seminar60%
Unlocking the Secrets of Microglia in Neurodegenerative diseases: Mechanisms of resilience to AD pathologies
neuro
Apr 30, 2025
UC Irvince