AI Evaluation Data Scientist - Health, Mid Level Job at Jobright.ai, Cupertino, CA

WmoxZTgzZ1kwWXI4UWUwZXVpNG1IQnFUcGc9PQ==
  • Jobright.ai
  • Cupertino, CA

Job Description

Jobright is an AI-powered career platform that helps job seekers discover the top opportunities in the US. We are NOT a staffing agency. Jobright does not hire directly for these positions. We connect you with verified openings from employers you can trust.

Job Summary:

Apple is a leading technology company focused on health technologies that support users in living healthier lives. The AI Evaluation Data Scientist in the Health team will develop and validate evaluation methodologies for Generative AI systems, design human annotation frameworks, and conduct statistical analyses to enhance the quality of health products.

Responsibilities:

• Design and analyze human evaluations of AI systems to create reliable annotation frameworks, and ensure validity and reliability of measurements of latent constructs

• Develop and refine benchmarks and evaluation protocols, using statistical modeling, test theory, and task design to capture model performance across diverse contexts and user needs

• Conduct statistical analysis of evaluation data to extract meaningful insights, identify systematic issues, and inform improvements to both models and evaluation processes

• Analyze model behavior, identify weaknesses, and drive design decisions with failure analysis. Examples include, but not limited to: model experimentation, adversarial testing, counterfactual analysis, creating tools to assess model behavior and user impact

• Collaborate with engineers to translate evaluation methods and analysis techniques into scalable, adaptable, and reliable solutions that can be reused across different features, use cases, and evaluation workflows

• Work cross-functionally to apply methods to real-world applications with designers, clinical experts, and engineering teams across Hardware and Software

• Independently run and analyze experiments for real improvements

Qualifications:

Required:

• Bachelor's degree (or equivalent experience) in a empirical field with emphasis on quantitative methodologies of human behavior, including HCI, Psychometrics, Quantitative or Experimental Psychology, Educational Measurement, Language Assessment, or a relevant field

• Proficiency in Python and ability to write clean, performant code and collaborate using standard software development practices (e.g. Git)

• Strong statistical analysis skills and experience in crafting experiments, validating data quality and model performance

• Experience in building and extending data and inference pipelines to process large scale datasets

Preferred:

• MS and a minimum of 3 years of relevant industry experience or PhD in relevant fields

• Real-world experience with LLM-based evaluation systems and human annotation and human evaluation methodologies

• Experience in rigorous, evidence-based approaches to test development, e.g. quantitative and qualitative test design, reliability and validity analysis

• Customer-focused mindset with experience or strong interest in building consumer digital health and wellness products

• Strong communication skills and ability to work cross-functionally with technical and non-technical stakeholders

Company:

Apple is a technology company that designs, manufactures, and markets consumer electronics, personal computers, and software. Founded in 1976, headquartered in Cupertino, California, USA, team size 10001+ employees, currently Public Company. Apple has a track record of offering H1B sponsorships.

Job Tags

H1b,

Similar Jobs

Your Home Sold Guaranteed Realty - Coldwell Real Estate Serv...

Real Estate Sales Buyer's Agent Job at Your Home Sold Guaranteed Realty - Coldwell Real Estate Serv...

 ...to do the fun part over and over.Responsibilities: Provide potential home buyers with pertinent information about their local housing market Coordinate efforts to negotiate property sale between buyer and seller or listing agent to achieve desired results Arrange... 

TEL Staffing & HR

Dump Truck Driver - Manual CDL Job at TEL Staffing & HR

 ...Now hiring for 2 Full Time CDL Truck Drivers - MANUAL TRANSMISSION (Dump-Trucks, Roll-off Trucks, and/or Hook Trucks) Safely operate Dump-Truck, Hook Truck, and/or Roll-off Truck to transport and deliver roll-off containers following a daily schedule Perform pre... 

Alyeska Resort

Lead Host, Seven Glaciers Job at Alyeska Resort

 ...and ensuring seamless operations?If you thrive in a fast-paced restaurant environmentand want to be part of a team that values quality...  ...within the introductory period, the following:a TAPS card & a Food Handlers Card. What to Expect &##128204;Be on your feet... 

Coach

Lead Supervisor Job at Coach

 ...Supervisor in Clinton, CT, to drive sales and lead a team in achieving retail excellence. The role involves strategic sales planning, team...  ...a brand-aligned store environment. Candidates should have 3-5 years of retail management experience, preferably in luxury fashion.

Gpac

Food Safety and Quality Assurance Manager Job at Gpac

Gpac is currently working with a food manufacturer in your area who is seeking an experienced Food Safety and Quality Assurance Manager . This is a unique opportunity to lead the development and implementation of a comprehensive food safety and quality management system...