Description: Our client is currently seeking a Project Manager II
Overall Responsibilities: (Share 2-3 sentences about the POSITION)
Lead structured pre-launch safety, neutrality and fairness testing, end to end, for GenAI products. For each launch, this will involve defining the standards applicable to the product, defining and executing prompt generation strategies, collaborating with product teams to scrape responses, working with our extended workforce to execute prompt/response rating against defined standard (incl. providing clear instructions, clarifying gray area cases, and/or providing quality calibrations), and conducing in-depth quantitative and qualitative analysis of results.
Top 3 Daily Responsibilities: (3+ bullets of the main responsibilities on the assignment)
Intake and triage new launch submissions; understand requirements and kick off engagement or schedule for future start date.
Move existing launches through the pre-launch testing process, using Buganizer bugs to track progress. This may include conducting work independently or collaborating with stakeholders to gather information or ensure they are taking action on their end. Key steps will include:
Aligning on the safety, neutrality and fairness standards applicable to the product, with an eye for driving consistency across product areas. Translating standards into clear guidelines that can be used to evaluate whether the product's output is compliant with standards.
Defining and executing prompt generation strategies to develop a set of prompts that will sufficiently test product compliance with standards. This may entail leveraging LLM-based prompt generation tools and/or defining and providing clear instructions to vendor teams.
Collaborating with product teams to scrape responses. This may entail providing consultation for how to develop a scaled scraping solution (UI, API, etc.), getting access to the model/UI and performing scrapes, and/or defining and providing clear instructions to vendor teams.
Executing prompt/response rating against defined standards. This may entail providing clear instructions to vendor teams, clarifying gray area cases, and/or providing quality calibrations.
Deep dive analysis: Conduct in-depth quantitative and qualitative analysis of results, including unexpected, interesting and edge cases, providing clear and actionable insights to inform decision-making around pre- and post-launch mitigation steps.
Mandatory Skills/Qualifications: (All skills, both technical and soft, required to be successful in the role)
Bachelor's degree or equivalent practical experience.
4 years of experience in any one of data analytics, Trust & Safety, policy, cybersecurity, or related fields.
Experience using data to provide solutions and recommendations.
Excellent communication and presentation skills (written and verbal) and the ability to influence cross-functionally at various levels.
This role may be exposed to graphic, controversial, and/or upsetting content.
Non-Essential Skills/Qualifications: (Skills that would be nice to have but are not essential in the role)
2+ years of experience in trust and safety, product policy, privacy and security, legal, compliance, risk management, intel, content moderation, red teaming, AI testing, adversarial testing, or similar.
1+ years of experience in business process analysis, operations management, and/or global program management, or leading cross-functional process improvements
Strong understanding of AI systems, machine learning, and their potential risks.
Ability to think strategically and identify emerging threats and vulnerabilities.
Excellent problem-solving and critical thinking skills with attention to detail in an ever-changing environment.
Proven ability to work independently and as part of a team.
By providing your phone number, you consent to: (1) receive automated text messages and calls from the Judge Group, Inc. and its affiliates (collectively "Judge") to such phone number regarding job opportunities, your job application, and for other related purposes. Message & data rates apply and message frequency may vary. Consistent with Judge's Privacy Policy, information obtained from your consent will not be shared with third parties for marketing/promotional purposes. Reply STOP to opt out of receiving telephone calls and text messages from Judge and HELP for help.
Contact: apandey08@judge.com
This job and many more are available through The Judge Group. Find us on the web at www.judge.com