Software Engineer- BIS (Baseten Inference Stack)
Baseten
- Dept
- Software Engineering
- Location
- San Francisco
- Comp
- -
Search 36,108 jobs across 1,642 companies, from pre-seed agent labs to frontier infrastructure teams.
1,642 companies · showing 12
AI safety company building reliable, interpretable, and steerable AI systems, creator of Claude.
Full-stack AI-native insurance carrier that automates underwriting, policy management, and claims for startups and commercial businesses.
Geordie AI builds an AI governance platform to help enterprises adopt agentic AI safely with observability, compliance controls, and risk intelligence.
Cognition is an AI agent lab building software engineering agents, including Devin, to help engineering teams plan, implement, and debug software more autonomously.
Pace is an AI-native business process outsourcer (BPO) for the world's leading insurers, combining agentic AI for document processing, web automation, and phone calls with human review to automate mission-critical insurance operations. The platform handles submission intake, FNOL, policy servicing, claims handling, and data entry tasks that traditionally relied on offshore labor. Backed by Sequoia Capital, Pace counts Prudential, The Mutual Group, and Newfront among its customers.
Semantic KV caching layer built for LLM inference, enabling AI applications to reduce inference costs and latency by reusing cached computation across similar prompts.
Flexprice is an open-source monetization infrastructure platform built for AI-native and SaaS companies. It enables teams to operate usage-based, credit-based, and hybrid pricing models with real-time metering and reporting, following an open-core model with self-hosted or cloud deployment options.
OpenRouter provides developers with seamless access to 500+ AI models through a single unified API, eliminating the need to rewrite code or renegotiate contracts for LLM integration.
Polsia is an autonomous multi-agent AI platform that plans, codes, markets, and manages businesses end-to-end. It deploys specialized AI agents covering orchestration, social media, email outreach, customer support, ads, finance, business planning, competitor research, and code generation. Customers pay $49/month plus a 20% revenue share on platform-generated economic activity.
Serverless cloud platform for developing, deploying, and scaling AI applications. Developers can run any code in the cloud without managing infrastructure, with built-in support for GPU workloads and batch jobs.
Modern procurement platform providing Source-to-Pay solutions that help companies keep spending under control while enhancing their procurement teams through AI-powered tools.
Catena Labs builds AI-native financial infrastructure spanning agent identity, stablecoins, and banking rails so AI agents can participate in the economy safely.
36,108 jobs available · showing 20
Baseten
CodeRabbit
CodeRabbit
Harper
Heidi Health
Heidi Health
Hippocratic AI
Listen Labs
Listen Labs
Modal
Modal
Modal
Nudge
OpenAI
OpenAI
OpenAI
OpenAI
OpenAI
OpenAI
Phylo