Assistant Data Scientist
The mission of The University of Texas MD Anderson Cancer Center is to eliminate cancer in Texas, the nation, and the world through outstanding programs that integrate patient care, research and prevention, and through education for undergraduate and graduate students, trainees, professionals, employees and the public.
The primary purpose of the Assistant Data Scientist is to assist in the analysis and development of algorithms and software for clinical, operational, and scientific problems. This position impacts clinical, operational, and scientific research projects.
JOB SPECIFIC COMPETENCIES
Analyze Next-generation Sequencing Data and Deploy Immunogenomic Tools:
Run somatic mutation, copy number and transcriptional alteration pipelines on matched tumor/normal pairs whole exome/genome and RNA-seq data.
Run clonality estimation and mutational phylogeny inference tools.
Perform single-point and serial analysis of single-cell RNA-seq, ATAC-seq, CITE-seq and TCR/BCR-seq data.
Create pipelines comprising a wide-ranging suite of in-house and external immune deconvolution, HLA inference, neoantigen prediction, T- and B-cell repertoire analysis software.
Write custom scripts to improve immunogenic methods developed in the lab. Package tools using Docker and Conda applications.
Assisting & Teaching Others:
Assists in the analysis and development of formalisms, algorithms and software.
With direction, develops and maintains software tools and infrastructure for resolving specific problems.
Maintains pipelines for analysis of sequencing data.
Contributes to the development of procedures and makes recommendations on new procedures; contributes to intellectual atmosphere through journal clubs, sharing of ideas and constructive feedback on others' presentations, and collaboration.
Support Ongoing Research Projects:
Provide bioinformatic and basic statistical support to ongoing research projects including differential gene expression/mutation analysis, gene set enrichment and pathway analysis, custom sequence alignments etc.
Download and analyze publicly available datasets such as TCGA, CCLE, GTEx and GEO.
Generate custom visualizations of analytical results.
Organize data and document and code developed for each project
Other duties as assigned.
Education Required: Bachelor's degree in Biomedical Engineering, Electrical Engineering, Computer Engineering, Physics, Applied Mathematics, Statistics, Computer Science, Computational Biology, or related field.
Education Preferred: Master's degree in a related field.
Experience Required: None
It is the policy of The University of Texas MD Anderson Cancer Center to provide equal employment opportunity without regard to race, color, religion, age, national origin, sex, gender, sexual orientation, gender identity/expression, disability, protected veteran status, genetic information, or any other basis protected by institutional policy or by federal, state or local laws unless such distinction is required by law. http://www.mdanderson.org/about-us/legal-and-policy/legal-statements/eeo-affirmative-action.html