Job Overview
Job Title
AI Architect
Company
Creative Chaos
Location
Pakistan
Job Type
Full-Time
Experience
Senior
About This Role
Job Summary:
As an AI Architect you will build AI-native products. You’ll lead cross-functional Innovation Delivery Squads—owning outcomes end-to-end across web, mobile, AI agents, and streaming backends. You’re a hands-on technical leader who can scope, architect, staff, and ship; then run the product safely at scale.
Job Responsibilities:
- Stand up and run squads (Discovery → Prototype → Product → Platform & SRE).
- Design and ship RAG/agent systems: pick models (e.g., Anthropic Claude, OpenAI, Google, or open-weights like Llama/Mistral), define tools/functions, and choose retrieval (default Postgres + pgvector, scale to Weaviate/Qdrant/Pinecone when needed).
- Operate AI safely: evals & guardrails, structured outputs (JSON/Schema), PII redaction, refusal policies, cost/latency budgets, and LLM observability.
- Own delivery outcomes: SLOs, quality, cost, velocity; release with feature flags and canaries.
- Be client-facing: discovery, scoping, SoW, roadmap, QBRs.
- Hire/coach Tech Leads, EMs, and PMs; level up practices.
Requirements
- 8–12+ yrs engineering; 4+ yrs leading multi-team delivery; shipped production web/mobile systems at scale.
- Shipped at least one production AI app using Claude/GPT/Gemini/Llama/Mistral, backed by retrieval (pgvector or a vector DB) and a basic eval/guardrail pipeline.
- Implemented orchestration (LangGraph/DSPy or Temporal for durable workflows), rerankers (e.g., Cohere/Jina/Voyage), and prompt/tool versioning.
- Built with modern cloud + data: serverless/K8s, Terraform, OpenTelemetry, feature flags/experimentation.
- Excellent client communication and commercial sense (SoWs, staffing, utilization).
Tech stack (you have hands on experience)
- Models: Anthropic Claude; OpenAI; Google; open-weights (Llama, Mistral).
- Orchestration & agents: LangGraph (or DSPy) for graphs; Temporal for durable, long-running tasks and SLAs.
- Retrieval: Postgres + pgvector (default); Weaviate/Qdrant/Pinecone when scale/ops require; hybrid search with OpenSearch/Typesense.
- Embeddings / rerankers: OpenAI/Voyage/E5/BGE; Cohere/Jina/Voyage rerank.
- Guardrails & evals: JSON/Pydantic schemas, red-team sets, promptfoo/Ragas/DeepEval; content/PII filters.
- Observability: OpenTelemetry traces incl. prompt/tool spans; Langfuse/Arize Phoenix (or equivalent) + Sentry/Grafana.
- App & data: Next.js 15 (RSC), TypeScript/Go/Python; Postgres; Kafka/Redpanda/NATS; dbt/lakehouse optional.
- Ops: Cloud Run/ECS/K8s; Terraform/OpenTofu; GitHub Actions; LaunchDarkly/Unleash; Statsig/GrowthBook.
Originally posted on Himalayas
Why This Job Might Be a Good Fit
- Fully remote full-time position
- Senior data role at Creative Chaos
- Open to candidates in Pakistan
Similar Remote Jobs
Dell Technologies
More Remote Jobs by Location
More Remote Data Jobs
Get Daily Remote Job Alerts Before Others Do
Join 12,000+ remote professionals
No spam, unsubscribe anytime. We respect your privacy.
Frequently Asked Questions
Is this position fully remote?
Yes, this role is listed as a remote position. You can work from anywhere within the specified location requirements.
How do I apply for this job?
Click the "Apply on Company Website" button to be redirected to the official application page.
Are international applicants welcome?
Check the location requirements listed above. Some positions are restricted to specific regions.
When was this job posted?
The posting date is shown in the Quick Facts sidebar. We update our listings daily to ensure accuracy.
About Creative Chaos
Creative Chaos