Research Scientist, Natural Language Processing
Location Type: On-site
Job Number: 250000FN
Category: Research & Laboratory
Job Number: 250000FN
Category: Research & Laboratory
WHY UT SOUTHWESTERN?
With over 75 years of excellence in Dallas-Fort Worth, Texas, UT Southwestern is committed to excellence, innovation, teamwork, and compassion. As a world-renowned medical and research center, we strive to provide the best possible care, resources, and benefits for our valued employees. Ranked as the number 1 hospital in Dallas-Fort Worth according to U.S. News & World Report, we invest in you with opportunities for career growth and development to align with your future goals. Our highly competitive benefits package offers healthcare, PTO and paid holidays, on-site childcare, wage, merit increases and so much more. We invite you to be a part of the UT Southwestern team where you’ll discover a culture of teamwork, professionalism, and a rewarding career!
JOB SUMMARY
We are seeking a skilled and highly motivated full-time Machine Learning Research Scientist to join our innovative team in the Departments of Neurology and Bioinformatics. This role entails developing and applying machine learning and NLP techniques to analyze extensive audio datasets and transcribed text, aiming to identify predictive markers of dementia progression. The successful candidate will play a pivotal role in our groundbreaking research into dementia, working closely under the close mentorship of several faculty, including Prof. Ihab Hajjar, MD, an authority in dementia, and Prof. Albert Montillo, PhD, a renowned machine learning expert.
· Access to Large Multiyear Datasets
o You will have access to one of the most extensive multiyear datasets available in the field, providing a rich foundation for impactful research.
· High-Performance Computing Resources
o Leverage one of the largest high-performance compute clusters, facilitating advanced computational experiments and model training at an unprecedented scale.
· Advanced Machine Learning Development for audio analysis
o Design and implement algorithms to extract features from audio recordings that indicate dementia.
o Utilize cutting-edge machine learning techniques to analyze large-scale audio data for pattern recognition and feature extraction.
· Natural Language Processing (NLP) on Transcribed Text
o Employ NLP methods on transcribed speech to identify linguistic markers related to dementia progression.
o Develop models to correlate changes in speech patterns, vocabulary, and syntax with dementia stages.
· Predictive Modeling for Dementia Progression
o Design, train, and validate machine learning models using large datasets.
o Ensure models are robust, scalable, and accurately predict dementia progression scores.
· Collaboration and Research
o Work under the mentorship of Prof. Hajjar and Prof. Montillo, contributing to and leading interdisciplinary research initiatives.
o Collaborate with a team of experts in a dynamic, innovative research environment.
· Proficiency in NLP, with experience in text analysis, language modeling, and sentiment analysis. Experience with one or more modern NLP frameworks and tools (e.g., Transformers library, NLTK, SpaCy, BERT, AllenNLP, Gensim), is a strong plus.
· Expertise in programming languages such as Python, R, or Java, and familiarity with machine learning libraries (e.g., TensorFlow, PyTorch, Scikit-learn).
· Experience with data preprocessing, feature extraction, and model validation techniques.
· Experience in biomedical data analysis or working on healthcare-related machine learning projects.
· Familiarity with audio processing libraries (e.g., LibROSA, AudioKit, Kaldi, PyAudio, SpeechRecognition, Praat, Wav2Vec).
· PhD in Computer Science, Data Science, Engineering, Statistics, or a related field with a focus on machine learning.
· Strong background in machine learning, including supervised and unsupervised learning, and deep learning.
· Experience in biomedical data analysis or working on healthcare-related machine learning projects.
· Strong statistical analysis skills and experience working with large, complex datasets.
· Excellent problem-solving abilities and attention to detail.
· Knowledge of high-performance computing technologies (Linux) and batch scheduling and/or cloud computing for big data experiments (SLURM, Ray, Hadoop, Spark).
· Demonstrated ability to work in interdisciplinary teams and strong communication skills for conveying complex concepts to non-technical stakeholders.
· Publications in relevant scientific journals and a track record of successful research projects.
BENEFITS
UT Southwestern is proud to offer a competitive and comprehensive benefits package to eligible employees. Our benefits are designed to support your overall wellbeing, and include:
EXPERIENCE AND EDUCATION
Required
JOB DUTIES
SECURITY AND EEO STATEMENT
Security
This position is security-sensitive and subject to Texas Education Code 51.215, which authorizes UT Southwestern to obtain criminal history record information. To the extent this position requires the holder to research, work on, or have access to critical infrastructure as defined in Section 113.001(2) of the Texas Business and Commerce Code, the ability to maintain the security or integrity of the critical infrastructure is a minimum qualification to be hired and to continue to be employed in the position.
EEO Statement
UT Southwestern Medical Center is committed to an educational and working environment that provides equal opportunity to all members of the University community. As an equal opportunity employer, UT Southwestern prohibits unlawful discrimination, including discrimination on the basis of race, color, religion, national origin, sex, sexual orientation, gender identity, gender expression, age, disability, genetic information, citizenship status, or veteran status.
Explore the full range of opportunities we offer within our research areas and labs.
If you want a larger canvas for exploration and discovery, look no further. We have more than 500 labs on campus. Here are just a few of the research investigations you’ll find underway at UT Southwestern Medical Center:
Work here, and you’ll help us create and conduct translational bench-to-bedside research that quickly moves basic discoveries into treatments that directly benefit patients. Here’s an example: We’ve found that small artificial molecules called peptoids show promise as both diagnostic tools and treatments for various types of cancer. Peptoids can bind to cancerous cells more tightly than normal cells. So our researchers are looking at ways to combine peptoids with anti-cancer drugs to target cancer cells more specifically.
At UT Southwestern, you can be part of our next breakthrough.
As a progressive, innovation-driven medical center, UT Southwestern relies heavily on our lab techs, medical techs, histotechnologists, and cytotechnologists to play a key role in the diagnosis and care planning of patients.
Our broad spectrum of services will present you with opportunities to work across a range of dynamic lab settings. You’ll have abundant tools, training, and support to facilitate your success and professional growth.
For each member of this diverse and collaborative team, we provide excellent benefits, including PTO and pension and retirement plans. When you consider Dallas’ relatively low cost of living and high quality of life, UT Southwestern emerges as your best option for a fulfilling career.
There are many ways to contribute here. As we continue to explore new solutions that will save lives and enhance quality of life for millions, we’re committed to providing rigorous scientific training in both basic and clinical research to scholars across the Medical Center and community. You can take advantage or help support a number of robust programs, such as:
Fellowship Program (QP-SURF)
Our research teams help plan, conduct, fund, administer, and report on clinical trials across the broad spectrum of health conditions and diseases. More than 1,000 trials are currently underway, including these areas of medicine: