Special Notice:
This position is NOT contingent upon awarding of a project or needing a funding source. This is full-time employment with webAI.
About the Role:
We are seeking a Senior Machine Learning Engineer to support our Public Sector initiatives focused on building and optimizing production ready AI systems for secure and distributed environments.
You will be responsible for transforming prototype models into scalable, efficient, and reliable production systems that operate seamlessly across a spectrum of hardware from government cloud infrastructure to edge devices in restricted or disconnected environments.
Responsibilities:
Design, develop, and deploy agentic workflows to orchestrate multi-step reasoning, tool use, and decision-making across production systems.
Productionize AI models from research prototypes into scalable, deployable systems used in real world applications.
Engineer adaptive ML systems using LoRA, PEFT, and on-device inference strategies, leveraging PyTorch, TensorFlow, and Hugging Face Transformers for model development, fine-tuning, and optimization.
Implement model optimization techniques such as quantization, pruning, distillation, and hardware specific acceleration.
Build and maintain Retrieval Augmented Generation (RAG) pipelines, including vector database integration for contextual retrieval.
Work with multi-modal AI systems across computer vision, audio, and natural language domains.
Optimize model execution for distributed and resource constrained environments, ensuring reliability under variable connectivity conditions.
Qualifications:
Active US Security clearance
4+ years of experience in applied AI, ML engineering, or production AI systems.
Deep proficiency in PyTorch, TensorFlow, or Hugging Face Transformers.
Proven experience deploying AI models across cloud, edge, and mobile hardware environments.
Expertise in model compression and optimization (quantization, pruning, distillation).
Experience building RAG pipelines and integrating vector databases (e.g., Quadrant, ChromaDB, FAISS, Milvus, Pinecone).
Familiarity with multi-modal models and synthetic data generation methods.
Strong algorithmic and problem solving skills, especially in distributed or constrained compute environments.
Preferred Skills:
Experience with edge AI, federated learning, or offline inference systems.
Understanding of AI governance and compliance frameworks relevant to public sector deployments.
Experience integrating models into large scale distributed systems or microservice architectures.
Excellent communication and technical documentation skills for collaboration across multi disciplinary teams.
Strong understanding of GPU computing, CUDA, and performance profiling.
We at webAI are committed to living out the core values we have put in place as the foundation on which we operate as a team. We seek individuals who exemplify the following:
Truth - Emphasizing transparency and honesty in every interaction and decision.
Ownership - Taking full responsibility for one’s actions and decisions, demonstrating commitment to the success of our clients.
Tenacity - Persisting in the face of challenges and setbacks, continually striving for excellence and improvement.
Humility - Maintaining a respectful and learning-oriented mindset, acknowledging the strengths and contributions of others.
Benefits:
Competitive salary
Comprehensive health, dental, and vision benefits package
401(k) match (U.S.-based employees only)
$200/month Health & Wellness stipend
Continuing Education support
$500/year Function Health subscription (U.S.-based employees only)
Free parking for in-office employees
Flexible Time Off (FTO)
Parental leave for eligible employees
Supplemental life insurance
webAI is an Equal Opportunity Employer and does not discriminate against any employee or applicant on the basis of age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We adhere to these principles in all aspects of employment, including recruitment, hiring, training, compensation, promotion, benefits, social and recreational programs, and discipline. In addition, it is the policy of webAI to provide reasonable accommodation to qualified employees who have protected disabilities to the extent required by applicable laws, regulations and ordinances where a particular employee works.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
dMetrics seeks a Senior Backend Engineer to design and implement high‑throughput Java services and data pipelines that power a production ML/NLP analytics platform.
CDW is hiring a remote Software Engineer I (Backend) to build and maintain Flask-based REST and GraphQL APIs on AWS while ensuring quality, performance, and secure production operations.
Winsupply seeks an Intermediate Full-Stack Java Developer at its Moraine support campus to design, build, and maintain scalable RESTful services and integrations from design through production.
Benepass is hiring a Senior Design Engineer to design, build, and evolve a scalable React/TypeScript design system and component library that bridges design and engineering.
Experienced MuleSoft engineer needed to design and deliver Mule 4 APIs and integrations on Anypoint Platform to support enterprise connectivity and scalable production integrations for AXS.
U-Haul Mobile is hiring an iOS Developer Intern to work with Swift and Xcode on customer-facing and internal apps, gaining hands-on experience across the full mobile development lifecycle.
Experienced software engineer needed to build and maintain cloud-based, customer-facing legal software using Java, JavaScript frameworks (e.g., Angular), and AWS in a hybrid Agile team environment.
At Hinge Health, you will build scalable geospatial search and entity resolution systems that help millions quickly find the right MSK care across providers and locations.
Ivo seeks a Backend Software Engineer to build scalable pipelines and search systems that analyze millions of contracts using LLM orchestration and advanced clustering.
Tenex is hiring a Software Engineer II to develop scalable full-stack systems for its AI-native MDR platform and help shape product and engineering practices in a fast-growing startup.
GoodLeap is hiring a Senior Full-Stack Software Engineer/Tech Lead to drive frontend-focused, full-stack initiatives and build scalable, AI-enabled finance platform features while mentoring teammates.
Lead the engineering organization for Skylar AI—shaping platform architecture, teams, and processes to deliver scalable, responsible AI for enterprise IT operations.
Own the core platform at Alpaca Health — design the domain model, enforce money and state invariants, and build reliable LLM-powered clinical workflows for a fast-growing healthcare startup.