I
Full-time
Remote
India
AI Developer

Req number:

R5796

Employment type:

Full time

Worksite flexibility:

Remote

Who we are

CAI is a global technology services firm with over 8,500 associates worldwide and a yearly revenue of $1 billion+. We have over 40 years of excellence in uniting talent and technology to power the possible for our clients, colleagues, and communities. As a privately held company, we have the freedom and focus to do what is right—whatever it takes. Our tailor-made solutions create lasting results across the public and commercial sectors, and we are trailblazers in bringing neurodiversity to the enterprise.

Job Summary

As an AI Developer, you specialize in agentic AI frameworks—Langchain, LangGraph, CrewAI, or equivalents—and who can take both vision and language models from prototype to production. You will lead the design of multi agent systems that coordinate perception (image classification & extraction), reasoning, and action, while owning the end-to-end deep learning life cycle (training, scaling, deployment, and monitoring).

Job Description

We’re seeking an AI Developer who specializes in agentic AI frameworks—Langchain, LangGraph, CrewAI, or equivalents—and who can take both vision and language models from prototype to production. You will lead the design of multi agent systems that coordinate perception (image classification & extraction), reasoning, and action, while owning the end-to-end deep learning life cycle (training, scaling, deployment, and monitoring). This is a Full-time and Remote position.

What You’ll Do

  • Agentic AI Frameworks (Primary Focus): Architect and implement multi‑agent workflows using Langchain, LangGraph, CrewAI, or similar.
    Design role hierarchies, state graphs, and tool integrations that enable autonomous data processing, decision‑making, and orchestration.
    Benchmark and optimize agent performance (cost, latency, reliability).
  • Image Classification & Extraction: Build and fine‑tune CNN/ViT models for classification, detection, OCR, and structured data extraction.
    Create scalable data‑ingestion, labelling, and augmentation pipelines.
  • LLM Fine‑Tuning & Retrieval‑Augmented Generation (RAG): • Fine‑tune open‑weight LLMs with LoRA/QLoRA, PEFT; perform SFT, DPO, or RLHF as needed.
    Implement RAG pipelines using vector databases (FAISS, Weaviate, pgvector) and domain‑specific adapters.
  • Deep Learning at Scale: Develop reproducible training workflows in PyTorch/TensorFlow with experiment tracking (MLflow, W&B).
    Serve models via TorchServe/Triton/KServe on Kubernetes, SageMaker, or GCP Vertex AI.
  • MLOps & Production Excellence: Build robust APIs/micro‑services (FastAPI, gRPC).
    Establish CI/CD, monitoring (Prometheus, Grafana), and automated retraining triggers.
    Optimize inference on CPU/GPU/Edge with ONNX/TensorRT, quantization, and pruning.
  • Collaboration & Mentorship: Translate product requirements into scalable AI services.
    Mentor junior engineers, conduct code and experiment reviews, and evangelize best practices.

What You'll Need

  • B.S./M.S. in Computer Science, Electrical Engineering, Applied Math, or related discipline.
  • 5+ years building production ML/DL systems with strong Python & Git.
  • Demonstrable expertise in at least one agentic AI framework (Langchain, LangGraph, CrewAI, or comparable).
  • Proven delivery of computer‑vision models for image classification/extraction.
  • Hands‑on experience fine‑tuning LLMs and deploying RAG solutions.
  • Solid understanding of containerization (Docker) and cloud AI stacks (AWS/Azure).
  • Knowledge of distributed training, GPU acceleration, and performance optimization.

Physical Demands

  • This role involves mostly sedentary work, with occasional movement around the office to attend meetings, etc.
  • Ability to perform repetitive tasks on a computer, using a mouse, keyboard, and monitor.

Reasonable accommodation statement

If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employment selection process, please direct your inquiries to application.accommodations@cai.io or (888) 824 – 8111.