Job Title: Machine Learning Engineer – AI Model Post-Training and Inference
Position Type: Full-time
Location: Remote / Hybrid / Onsite
Team: Engineering & Research
Salary Range: $200k-300k Base+ bonus
Job Description:
As a Machine Learning Engineer on the post-training and inference team at IntelliPro, you will bridge state-of-the-art AI research with production systems. You will develop pipelines that synthesize data, fine-tune and align models, compress large models, and deploy them at scale across our infrastructure. This hands-on engineering role demands both deep technical expertise in generative AI and strong software-engineering skills to deliver performant, reliable, and secure services
Key Responsibilities:
• Develop post-training pipelines: Build and maintain systems that generate synthetic data, perform supervised fine-tuning (SFT) and reinforcement learning. Experience with advanced alignment methods such as Direct Preference Optimization (DPO) and Proximal Policy Optimization (PPO) is a plus.
• Model compression and optimization: Apply quantization (e.g., GPTQ, SmoothQuant, AWQ), pruning, and distillation techniques to reduce model size and improve inference latency. Evaluate trade-offs between accuracy, throughput, and memory, collaborating with researchers on architectural choices.
• Scalable deployment: Design and deploy inference services using modern serving stacks such as vLLM, SGLang, and TensorRT-LLM. Leverage speculative decoding, model parallelism, and container orchestration for high-throughput, low-latency deployment.
• Cross-functional collaboration: Partner with researchers, product managers, and infrastructure engineers to bring new AI capabilities to market. Conduct code reviews, contribute to internal tooling and developer documentation, and ensure service reliability.
• Continuous improvement: Monitor performance metrics, identify bottlenecks in data pipelines or inference systems, and implement optimizations. Stay up-to-date with advances in LLMs, VLMs, diffusion models, and other multimodal architectures.
Requirements:
Minimum Qualifications
• Bachelor’s degree in Computer Science, Engineering, or a related technical field.
• 2+ years of experience in software or machine-learning engineering, including experience writing high-performance, production-quality code.
• Proficiency in Python and PyTorch; familiarity with systems languages such as Go, Rust, or C++ is beneficial.
• Experience building large-scale, fault-tolerant distributed systems and services.
• Strong understanding of transformer-based models and generative AI, including fine-tuning and RLHF workflows.
• Experience with model serving frameworks such as vLLM, SGLang, TensorRT-LLM, or similar inference engines.
• Excellent problem-solving skills and ability to communicate complex technical ideas to cross-functional teams.
Preferred Qualifications
• Master’s or Ph.D. in Computer Science, Electrical Engineering, or a related field.
• 5+ years of experience in software or ML engineering, with production-quality code delivery.
• Experience implementing RLHF, DPO, PPO, and evaluating alignment impact on model behavior.
• Hands-on experience with model compression techniques such as quantization, pruning/sparsity, and distillation.
• Familiarity with CUDA/Triton, GPU profiling tools, and distributed inference across multi-GPU clusters.
• Knowledge of ML compilers such as torch.compile, Triton, or XLA.
• Experience integrating multimodal AI systems (e.g., coding assistants, tool-calling agents, image/video/audio generation).
About Us:
Founded in 2009, IntelliPro is a global leader in talent acquisition and HR solutions. Our commitment to delivering unparalleled service to clients, fostering employee growth, and building enduring partnerships sets us apart. We continue leading global talent solutions with a dynamic presence in over 160 countries, including the USA, China, Canada, Singapore, Japan, Philippines, UK, India, Netherlands, and the EU.
IntelliPro, a global leader connecting individuals with rewarding employment opportunities, is dedicated to understanding your career aspirations. As an Equal Opportunity Employer, IntelliPro values diversity and does not discriminate based on race, color, religion, sex, sexual orientation, gender identity, national origin, age, genetic information, disability, or any other legally protected group status. Moreover, our Inclusivity Commitment emphasizes embracing candidates of all abilities and ensures that our hiring and interview processes accommodate the needs of all applicants. Learn more about our commitment to diversity and inclusivity at https://intelliprogroup.com/.