What is Confident AI?
We’re building 1) an open-source package called DeepEval to unit-test LLM applications such as chatbots, agents, and RAG pipelines, and 2) the cloud platform for DeepEval.
It's like Next.JS and Vercel.
The founding team is a small group of exceptional engineers and researchers from top colleges and companies such as Google, Microsoft, and Princeton.
Our Values and Morals
Things we value:
- Earnest and hardworking, the most important trait.
- No excuses or BS—if something is wrong, surface it so someone can help.
- Openness and transparency—hiding a problem won’t make it go away.
- No politics, micromanagement, or bureaucracy, even in controversial discussions.
- Autonomy, ownership, and responsibility—just as expected from any grown adult.
- No ghosting—respect others’ time and effort.
- Doers, not yappers, function over form. This means we're ok with remote work as long as you deliver.
Job Description
This role is 50% engineering, 50% developer marketing (may vary slightly based on time of the month). It is important that you understand the growth role isn’t a traditional SWE role where writing code is the majority of your day to day.
What you'll be doing:\
- Working on DeepEval (one of the used package for LLM evaluation in the world) for both LLM evaluation features and also LLM red teaming features for DeepTeam.
- Write high quality content around what you've built in the form of documentation and blog articles for the open-source community.
- Be able to post and repurpose content on platforms such as reddit, Twitter, community articles, LinkedIn, and other relevant channels.
- Be able to define and measure growth, what it means for a growth channel to be successful, and find new ways to increase DeepEval/Team’s distribution.
- Be able to take charge of the open-source community in both discord and github.
- Support integrations with other open-source projects and form relationships with them
Support our open-source community for any questions and help they might need.
You should be someone who:
- Enjoys writing and posting content on developer marketing channels.
- Have a taste for good documentation, design, and constantly find ways to improve how our packages are presented to the world.
- Already is proficient in open-source contributions, your github profile should be as green as a carpet.
- Is a quick learner, you will be picking up on how to do SEO, GEO, and other existing strategies within the company.
- Enjoy reading papers, and have a natural curiosity for new research.
- Code extremely proficiently and quickly in Python and Typescript.
- Communicates well.
- Work 6 days a week, we're not hiding we expect a lot from you.
Your work will:
- Be used by hundreds of thousands of open-source users, all the way from individual hobbyist to AI leaders at Fortune 500 companies to companies such as OpenAI and Google.
- Educate hundreds of thousands of people, that wouldn't otherwise know how to quality assure their LLM applications.
- Help grow DeepEval into the Nextjs of evals
By joining us, you will:
- Be shaping the future of LLM testing and evaluation.
- Learn how to run and do startups, in a relatively safe environment.
- Work closely with the founders, with the possibly of promoted to an executive role in the future.
- Be compensated well, with generous founding equity. This also means that we expect a lot from you.