Machine Learning Engineer, Trust & Safety

In A Nutshell

Location

Hybrid New York or San Francisco, NY or CA, USA

Salary

$340,000-$425,000

Job Type

Full-time

Experience Level

Mid-level

Visa Sponsorship

Available

Deadline to apply

February 21, 2025

Work to train models which detect harmful behaviors and help ensure user well-being and uphold AnthropicÂs principles of safety, transparency, and oversight while enforcing terms of service and acceptable use policies.

Responsibilities
Â Build machine learning models to detect unwanted or anomalous behaviors from users and API partners, and integrate them into our production system.
Â Improve our automated detection and enforcement systems as needed.
Â Analyze user reports of inappropriate accounts and build machine learning models to detect similar instances proactively.
Â Surface abuse patterns to our research teams to harden models at the training stage.
Skillset
Â Have 4+ years of experience in a research/ML engineering or an applied research scientist position, preferably with a focus on trust and safety.
Â Have proficiency in SQL, Python, and data analysis/data mining tools.
Â Have proficiency in building trust and safety AI/ML systems, such as behavioral classifiers or anomaly detection.
Â Have strong communication skills and ability to explain complex technical concepts to non-technical stakeholders.
Â Care about the societal impacts and long-term implications of your work.

Apply Now

Spot any inaccurate information? Have a job to share? Let us know.

Apply Job!

Machine Learning Engineer, Trust & Safety

Similar Jobs

Recent Jobs

You May Also Like