We are looking for a Senior Python / GenAI Engineer with strong experience building cloud-native applications and developing solutions powered by large language models. In this role, you will help design and implement scalable AI-driven services within a modern cloud environment, working closely with engineering teams to build APIs, optimize inference pipelines, and deliver production-ready GenAI applications.
Full Desk, Direct Hire role.
Hybrid opportunity in the Houston area.
Primarily a backend engineering role focused on AI platform development.
What you will be doing
Tech Breakdown
70% new development
30% optimization, support, and enhancements
Daily Responsibilities
• Design and implement scalable APIs and shared software services using Python
• Build and support applications on an AI platform leveraging modern GenAI technologies
• Develop LLM-powered applications using tools like LangChain or LlamaIndex
• Work with Azure OpenAI and/or Google GenAI APIs to integrate LLM capabilities into production systems
• Build cloud-native applications in Azure and GCP
• Implement serverless architectures and containerized workloads using Docker and Kubernetes
• Design, monitor, and optimize AI inference and data pipelines to improve performance and efficiency
• Troubleshoot issues across application code, cloud environments, and AI services
• Document architecture, APIs, and internal AI platform components
• Collaborate with internal engineering and support teams to deliver scalable AI-driven solutions
• Contribute to architecture discussions and continuously improve system performance, security, and scalability
• Develop proof-of-concepts (POCs) to evaluate emerging AI technologies and their potential business impact
Experience / Requirements
• 7+ years of full software development lifecycle experience including design, development, testing, deployment, and support
• Strong Python development experience
• Deep understanding of object-oriented design principles, design patterns, and scalable architecture
• Proven experience designing and implementing APIs and backend services
• Hands-on experience building and deploying applications in Azure and/or Google Cloud Platform
• Strong understanding of cloud scaling concepts and distributed systems
• 2+ years of experience building LLM-based applications
• Experience implementing RAG architectures, embeddings, vector databases, agents, and fine-tuning
• Experience with LLM orchestration frameworks such as LangChain or LlamaIndex
• Hands-on experience integrating OpenAI / Azure OpenAI / Google GenAI APIs
• Experience with containerization and orchestration (Docker, Kubernetes)
• Familiarity with version control tools such as Git / Azure DevOps
• Experience writing Azure DevOps YAML pipelines for CI/CD
• Experience with at least one additional object-oriented language such as Java, C#, or C++
• Strong communication skills and ability to work both independently and within collaborative engineering teams
Nice to Have
• AWS cloud experience
• Experience with Databricks or Apache Spark
• Knowledge of data warehousing and data lake architectures
• Experience working with relational databases and SQL
• Terraform infrastructure-as-code experience
• Experience building AI platform components or internal developer platforms
Education
Bachelor’s degree in Computer Science, Information Systems, or a related technical field