NLP LLM Specialist at University of Wisconsin–Madison

Back to jobs
University of Wisconsin–Madison

NLP LLM Specialist

18h ago
Location
Remote
Type
Remote · Full-time
Compensation
$76k – 76k/yr
Skills
PythonNatural Language Processing (nlp)Large Language Models (llms)Generative AIHugging Face TransformersSpacyPyTorchTensorFlow+29

About this role

Current Employees: If you are currently employed at any of the Universities of Wisconsin, log in to Workday to apply through the internal application process. Job Category: Academic Staff Employment Type: Regular Job Profile: Data Scientist III Job Summary: The Large Language Model (LLM) / Natural Language Processing (NLP) Engineer will serve as a hands-on technical contributor responsible for building, integrating, and operationalizing advanced language-model capabilities within the Wisconsin Health Data Hub (WHDH) platform. WHDH is a federally funded initiative developing a secure, cloud-native data ecosystem designed to support biomedical research, advanced analytics, and AI-driven discovery using real-world health data. This role focuses on the practical implementation of NLP and generative AI technologies that enable scalable analysis of large volumes of unstructured healthcare data such as clinical notes, research publications, and other text-based datasets. The engineer will design and deploy production-grade AI services, integrate LLM capabilities into the WHDH platform, and support researchers and partner organizations in leveraging these tools for applied healthcare analytics. The position requires a strong engineering mindset and the ability to translate emerging AI capabilities into reliable, scalable solutions operating within a secure research data environment. Key Responsibilities LLM & NLP Engineering • Design, implement, and maintain production-ready NLP pipelines for processing large volumes of unstructured healthcare and biomedical text data. • Fine-tune, deploy, and optimize large language models for domain-specific applications including clinical text analysis, semantic search, and automated summarization. • Develop services for entity extraction, concept normalization, document classification, and information retrieval from healthcare datasets. • Build reusable NLP components and APIs that can be integrated into analytics workflows across the WHDH platform. Platform Integration & AI Services • Integrate LLM and NLP capabilities into WHDH’s cloud-based data and analytics platform. • Develop scalable APIs and microservices that enable secure access to language-model capabilities by research teams and application developers. • Implement containerized services and deployment pipelines to operationalize AI models in production environments. • Work with teams to ensure NLP pipelines operate efficiently within large-scale distributed data processing environments. Applied AI Solution Development • Collaborate with platform engineers and domain experts to design AI-driven solutions that address real-world healthcare data challenges. • Translate emerging LLM capabilities into practical tools for clinical text processing, data enrichment, and knowledge extraction. • Rapidly prototype and iterate AI-enabled features that improve usability and accessibility of the WHDH data platform. • Support applied analytics initiatives that leverage LLM capabilities to enhance research workflows. Security, Data Governance & Responsible AI • Ensure all AI solutions comply with institutional data governance policies and healthcare data privacy requirements. • Implement safeguards for secure handling of sensitive healthcare text data within NLP workflows. • Support responsible use of generative AI technologies through appropriate monitoring, evaluation, and documentation practices. • Collaborate with platform security teams to ensure compliance with HIPAA-aligned infrastructure requirements It is anticipated that this position will be remote and requires work be performed at an offsite, non-campus work location. The position requires the finalist to reside within the State of Wisconsin or relocate to Wisconsin within a reasonable time frame from the start date of the position. Schedule is flexible within business hours of Monday through Friday 8:00a.m.- 4:30p.m. Key Job Responsibilities: • Prepares data sets for analysis including cleaning/quality assurance, transformations, restructuring, and integration of multiple data sources • Serves as an institutional subject matter expert and liaison to key internal and external stakeholders regarding data science best practices and methodologies and represents the interests of data science • Composes and assembles reproducible workflows and reports to clearly articulate patterns to researchers and/or administrators • Leverage modern NLP frameworks and LLMs to extract critical insights from unstructured clinical notes and reports, ensuring data quality and integrity through rigorous preprocessing. • Develop predictive models using retrospective real-world data to estimate disease risk, progression, and treatment effectiveness, while addressing bias and fairness. Design and execute rigorous hypothesis testing on observational datasets to validate research findings • Work closely with data governance and security to ensure compliance with privacy regulations (e.g., NIST, HIPAA) when working with healthcare data; and address bias and fairness issues in AI models when dealing with sensitive health data • Develop and implement informatics pipelines for the processing, integration, and harmonization of heterogeneous data sources • Identifies and implements or guides others in implementing appropriate data science techniques to find data patterns and answer research questions chosen by the lead researcher including data visualization, statistical analysis, machine learning, and data mining • Organizes and automates project steps for data preparation and analysis • Documents approaches to address research questions and contributes to the establishment of reproducible research methodologies and analysis workflows Department: School of Medicine and Public Health, Informatics and Information Technology, Wisconsin Health Data Hub. The Wisconsin Health Data Hub (WHDH) is a grant-funded initiative within the Office of Informatics and Information Technology (IIT) at the University of Wisconsin–Madison School of Medicine and Public Health. WHDH brings together a multidisciplinary team of technologists responsible for designing, implementing, and operating a secure data enclave that supports the responsible use of real-world health data for biomedical research. The WHDH team develops and manages a scalable data platform that enables researchers to efficiently access, integrate, and analyze large-scale health datasets from participating health systems. By providing advanced data services, governance frameworks, and analytical capabilities, WHDH accelerates the research lifecycle—from project conception and data acquisition to analysis and discovery—while ensuring compliance with applicable regulatory, privacy, and security requirements. Compensation: The starting salary/hourly wage for the position is $76,289 annually; but is negotiable based on experience and qualifications. Employees in this position can expect to receive benefits such as generous vacation, holidays, and sick leave; competitive insurances and savings accounts; retirement benefits. For more information, refer to the campus benefits webpage. ☒ SMPH Faculty /Academic Staff Benefits Flyer 2026 Required Qualifications: • 3 Years of full-time professional experience building or deploying NLP or machine learning solutions in production environments. (5 years preferred) • Strong programming experience in Python and familiarity with modern NLP frameworks such as Hugging Face Transformers, spaCy, PyTorch, or TensorFlow. • Experience working with large-scale data processing pipelines and distributed data environments. • Experience deploying AI models using containerization technologies such as Docker and orchestration frameworks such as Kubernetes. • Ability to design and build scalable APIs and backend services supporting AI-powered applications. Preferred Qualifications: • Experience working with biomedical or clinical text data. • Familiarity with healthcare data models and standards such as FHIR, OMOP, or UMLS. • Experience developing AI solutions in cloud environments such as AWS, Azure, or Google Cloud. • Experience with MLOps practices including model deployment, monitoring, and lifecycle management. • Familiarity with vector databases, embedding models, and retrieval-augmented generation (RAG) architectures. • Experience building generative AI applications using modern LLM frameworks. Education: PhD Preferred; Focus in Computer Science, Software Engineering, Artificial Intelligence, Data Science, or a related technical field preferred. How to Apply: For the best experience completing your application, we recommend using Chrome or Firefox as your web browser. To apply for this position, select either “I am a current employee” or “I am not a current employee” under Apply Now. You will then be prompted to upload your application materials. Important: The application has only one attachment field. Upload the following documents in that field, either as a single combined file or as multiple files in the same upload area. • Cover letter required • Resume Your cover letter should address [how your training and experience aligns with the required and preferred qualifications listed above]. Application reviewers will rely on these written materials to determine which applicants move forward in the process. References will be requested from final candidates. All applicants will be notified once the search concludes and a candidate is selected University sponsorship is not available for this position, including transfers of sponsorship and TN visas. The selected applicant will be responsible for ensuring their continuous eligibility to work in the United States (i.e. a citizen or national of the United States, a lawful permanent resident, a foreign national authorized to work in the United States without the need of an employer sponsorship) on or before the effective date of appointment. This position is an ongoing position that will require continuous work eligibility. If you are selected for this position you must provide proof of work authorization and eligibility to work. The department will not be able to support a request for a J-1 waiver. If you choose to pursue a waiver and apply for our position, neither the UW nor UWMF will reimburse you for your legal or waiver fees. Contact Information: Cody Roekle, croekle@wisc.edu, 16082637676 Relay Access (WTRS): 7-1-1. See RELAY_SERVICE for further information. Institutional Statement on Diversity: Diversity is a source of strength, creativity, and innovation for UW-Madison. We value the contributions of each person and respect the profound ways their identity, culture, background, experience, status, abilities, and opinion enrich the university community. We commit ourselves to the pursuit of excellence in teaching, research, outreach, and diversity as inextricably linked goals. The University of Wisconsin-Madison fulfills its public mission by creating a welcoming and inclusive community for people from every background - people who as students, faculty, and staff serve Wisconsin and the world. The University of Wisconsin-Madison is an Equal Opportunity Employer. Qualified applicants will receive consideration for employment without regard to, including but not limited to, race, color, religion, sex, sexual orientation, national origin, age, pregnancy, disability, or status as a protected veteran and other bases as defined by federal regulations and UW System policies. We promote excellence by acknowledging skills and expertise from all backgrounds and encourage all qualified individuals to apply. For more information regarding applicant and employee rights and to view federal and state required postings, visit the Human Resources Workplace Poster website. To request a disability or pregnancy-related accommodation for any step in the hiring process (e.g., application, interview, pre-employment testing, etc.), please contact the Divisional Disability Representative (DDR) in the division you are applying to. Please make your request as soon as possible to help the university respond most effectively to you. Employment may require a criminal background check. It may also require your references to answer questions regarding misconduct, including sexual violence and sexual harassment. The University of Wisconsin System will not reveal the identities of applicants who request confidentiality in writing, except that the identity of the successful candidate will be released. See Wis. Stat. sec. 19.36(7). The Annual Security and Fire Safety Report contains current campus safety and disciplinary policies, crime statistics for the previous 3 calendar years, and on-campus student housing fire safety policies and fire statistics for the previous 3 calendar years. UW-Madison will provide a paper copy upon request; please contact the University of Wisconsin Police Department.