PhDApplications Closed
Dr Nathan Skene
Unknown Organization
London, UK
Apply by Jan 31, 2021
Application deadline
Jan 31, 2021
Job
Job location
Dr Nathan Skene
London, UK
Geocoding in progress.
Source: legacy
Quick Information
Application Deadline
Jan 31, 2021
Start Date
Flexible
Education Required
See description
Experience Level
Not specified
Job
Job location
Dr Nathan Skene
Job Description
Using machine learning to predict cell-type specific effects of genetic variants which influence genome regulation.
This PhD project is focused on using machine learning techniques to develop novel classifiers for predicting how changes in DNA sequences alter genomic regulatory features. Many regulatory proteins recognise particular DNA sequences known as motifs, for instance, EcoRI only binds to GAATTC. DNA sequences can be converted into a machine interpretable format, using one-hot encoding. The candidate will use publicly available and inhouse datasets of genomic regulatory features to train models. Machine learning techniques will be used to predict the cell-type specific regulatory effects of genetic variants. We will provide several true-positive datasets, wherein the effect of genetic mutations on particular regulatory features has been measured. These will form validation datasets to evaluate how well the trained classifier works. We are interested in how improvements in the machine learning approach (e.g. use of transfer learning, recurrent attentional networks or graph convolution networks) can be used to improve upon existing methods. The candidate will use these techniques to identify causal pathways and candidate drug targets for neurodegenerative diseases.
This PhD project is focused on using machine learning techniques to develop novel classifiers for predicting how changes in DNA sequences alter genomic regulatory features. Many regulatory proteins recognise particular DNA sequences known as motifs, for instance, EcoRI only binds to GAATTC. DNA sequences can be converted into a machine interpretable format, using one-hot encoding. The candidate will use publicly available and inhouse datasets of genomic regulatory features to train models. Machine learning techniques will be used to predict the cell-type specific regulatory effects of genetic variants. We will provide several true-positive datasets, wherein the effect of genetic mutations on particular regulatory features has been measured. These will form validation datasets to evaluate how well the trained classifier works. We are interested in how improvements in the machine learning approach (e.g. use of transfer learning, recurrent attentional networks or graph convolution networks) can be used to improve upon existing methods. The candidate will use these techniques to identify causal pathways and candidate drug targets for neurodegenerative diseases.
Requirements
- Applicants must hold (or obtain by October 2020) a First Class or an Upper Second Class degree (or equivalent overseas qualification) in a quantitative discipline
- such as mathematics
- statistics
- computer science or engineering. Imperial would normally expect successful applicants to hold or achieve a Master's degree in a related field.
- Prior experience with programming is essential
- but no experience with biology is necessary. Experience using machine learning methods will be beneficial. Ideally the candidate will have experience using R and version control.
Related Domains
Job
Job location
Dr Nathan Skene
Coordinates pending.
About Unknown Organization
Learn more about this opportunity with Unknown Organization.
Related Seminars
Seminar60%
Pancreatic Opioids Regulate Ingestive and Metabolic Phenotypes
neuro
Jan 12, 2025
Washington University in St. Louis
Seminar60%
The Role of GPCR Family Mrgprs in Itch, Pain, and Innate Immunity
neuro
Jan 12, 2025
Johns Hopkins University
Seminar60%
Exploration and Exploitation in Human Joint Decisions
neuro
Jan 12, 2025
Munich