Anthropic is an AI safety and research company that’s working to build reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our customers and for society as a whole. Our interdisciplinary team has experience across ML, physics, policy, business and product.
Responsibilities:
- Lead a team of engineers building systems to detect and prevent harm and abuse using Anthropic's AI services
- Implement systems to detect fraudulent accounts, spam campaigns, harmful user generated content, and other malicious usage
- Analyze usage patterns and develop protections against new methods of attack and evasion
- Work closely with data scientists to develop algorithms and signals for detecting threats
- Build self-service tools for customers to monitor and control access to AI services
- Design our process for responding to detected signals, including communicating threats and remedies across the organization
- Coach and mentor team members in their career growth
You may be a good fit if you:
- Have 5+ years in an engineering management role, leading teams building integrity, trust and safety, or anti-fraud/abuse systems
- Have deep experience with techniques for bot detection, account fraud, misinformation, and/or harmful user-generated content
- Have the ability to balance speed and precision when responding to attacks and evaluating risk
- Have excellent communication skills to explain threats and tradeoffs to stakeholders
- Have people management skills in coaching, recruiting, and developing engineers
- Have experience designing operational processes around on-call, post-mortems, etc.
Strong candidates may also:
- Have a background in building systems at scale with a focus on reliability and performance
- Have experience with AI/ML and understanding how models can be manipulated
- Have knowledge of common internet communities, and adversaries like spammers, fraud rings, and their evolving techniques
- Use technical depth to assess and improve system designs
- Have project management skills to balance priority tradeoffs
Annual Salary:
The expected salary range for this position is $300k - $500k.
Compensation and Benefits:
Anthropic’s compensation package consists of three elements: salary, equity, and benefits. We are committed to pay fairness and aim for these three elements collectively to be highly competitive with market rates.
Equity - On top of this position's salary (listed above), equity will be a major component of the total compensation. We aim to offer higher-than-average equity compensation for a company of our size, and communicate equity amounts at the time of offer issuance.
US Benefits:
- Optional equity donation matching at a 3:1 ratio, up to 50% of your equity grant.
- Comprehensive health, dental, and vision insurance for you and all your dependents.
- 401(k) plan with 4% matching.
- 21 weeks of paid parental leave.
- Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more!
- Stipends for education, home office improvements, commuting, and wellness.
- Fertility benefits via Carrot.
- Daily lunches and snacks in our office.
- Relocation support for those moving to the Bay Area.
UK Benefits:
- Optional equity donation matching at a 3:1 ratio, up to 50% of your equity grant.
- Private health, dental, and vision insurance for you and your dependents.
- Pension contribution (matching 4% of your salary).
- 21 weeks of paid parental leave.
- Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more!
- Health cash plan.
- Life insurance and income protection.
- Daily lunches and snacks in our office.
#J-18808-Ljbffr