Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
AI Research Scientist - Evaluation, Handshake AI image - Rise Careers
Job details

AI Research Scientist - Evaluation, Handshake AI

Your impact

Handshake is building the future of human data for AI.

We partner directly with top AI labs to power large language model (LLM) training and evaluation with high-quality, expert-generated data. As AI models become more sophisticated, the demand for specialized human input continues to grow—and Handshake is uniquely positioned to meet it. We power career platforms at 92% of the top U.S. universities, giving us direct access to verified expert talent across a wide range of domains.

Our AI team is rapidly building a new generation of human data products—from expert annotation platforms to AI interviewers and seamless payout infrastructure—all designed to accelerate research and improve model performance.

We’ve assembled a world-class team from YC, Notion, Scale, Coinbase, Palantir, and more, and we’re working directly with many of the world’s leading AI research labs. This is a unique opportunity to join a fast-growing team shaping the future of AI through better data, better tools, and better systems—for experts, by experts.

We’re seeking Research Scientists to join our Handshake AI Research team, where you’ll conduct pioneering research that pushes the boundaries of LLM understanding and evaluation. You’ll work at the frontiers of model comprehension, advanced evaluation methodologies, and the intersection of human and AI knowledge systems, with the goal of building the future of how we understand and measure AI capabilities.

As a Research Scientist, you’ll collaborate closely with engineers and cross-functional teams to transform fundamental insights into breakthrough evaluation frameworks and understanding paradigms. Whether you’re developing novel approaches to probe model understanding, creating sophisticated benchmarks that reveal emergent capabilities, or establishing new methodologies for measuring AI-human knowledge alignment; you’ll help define how we comprehend and assess the next generation of AI systems.

Location: San Francisco or New York City

Your role

  • Design and conduct original research in LLM understanding, evaluation methodologies, and the dynamics of human-AI knowledge interaction

  • Develop novel evaluation frameworks and assessment techniques that reveal deep insights into model capabilities and limitations

  • Collaborate with engineers to transform research breakthroughs into scalable benchmarks and evaluation systems

  • Pioneer new approaches to measuring model understanding, reasoning capabilities, and alignment with human knowledge

  • Write high-quality code to support large-scale experimentation, evaluation, and knowledge assessment workflows

  • Publish findings in top-tier conferences and contribute to advancing the field’s understanding of AI capabilities

  • Work with cross-functional teams to establish new standards for responsible AI evaluation and knowledge alignment

Your experience

  • PhD or equivalent research experience in machine learning, computer science, cognitive science, or a related field with focus on AI evaluation or understanding

  • Strong background in LLM research, model evaluation methodologies, interpretability, or foundational AI assessment techniques

  • Demonstrated ability to independently lead post training and evaluation research projects from theoretical framework to empirical validation

  • Proficiency in Python and deep experience with PyTorch for large-scale model analysis and evaluation

  • Experience designing and conducting experiments with large language models, benchmark development, or systematic model assessment

  • Strong publication record in post training, AI evaluation, model understanding, interpretability, or related areas that advance our comprehension of AI capabilities

  • Ability to clearly communicate complex insights about model behavior, evaluation methodologies, and their implications for AI development

Nice to Have

  • Experience with RL, agent modeling, or AI alignment

  • Familiarity with data-centric AI approaches, synthetic data generation, or human-in-the-loop systems

  • Understanding of the challenges in scaling foundation models (e.g., training stability, safety, inference efficiency)

  • Contributions to open-source AI libraries or research tooling

  • Interest in shaping the societal impact, deployment ethics, and governance of frontier models

Why Join Us

  • This is a rare opportunity to help define how the world’s top labs build, test, and evaluate cutting-edge AI systems. You’ll be working with a uniquely high-talent team, tapping into a network of 18 million students and 500K+ PhDs, and shaping foundational infrastructure at a critical moment in the field. If you're excited to build from first principles—and want your work to directly accelerate frontier AI—we'd love to talk.

What we offer

At Handshake, we'll give you the tools to feel healthy, happy and secure.

Benefits below apply to US employees in full-time positions.

  • 💰 Equity and ownership in a fast-growing company.

  • 🍼 16 Weeks of paid parental leave for birth giving parents & 10 weeks of paid parental leave for non-birth giving parents.

  • 💝 Comprehensive medical, dental, and vision policies including LGTBQ+ Coverage. We also provide resources for Mental Health Assistance, Employee Assistance Programs and counseling support.

  • 📚 Generous learning & development opportunities and an annual $2,000 stipend for you to grow your skills and career.

  • 💰 Financial coaching through Origin to help you through your financial journey.

  • 🛜 Monthly internet stipend and a brand new MacBook to allow you to do your best work.

  • 🚃 Monthly commuter stipend for you to expense your travel to the office (for office-based employees).

  • 🥗 Free lunch provided 5x a week in office.

  • 🏋️ Free gym access in San Francisco office building.

  • 🤝 Referral bonus to reward you when you bring great talent to Handshake.

  • 🧗🏼Team outings throughout the year to stay connected to each other.

  • 🏦 401k Match: Handshake offers a dollar-for-dollar match on 1% of deferred salary, up to a maximum of $1,200 per year.

  • 🏝 All full-time US-based Handshakers are eligible for our flexible time off policy to get out and see the world. In addition, we offer 13 standardized holidays, and 2 additional days of flexible holiday time off. Lastly, we have a Winter #ShakeBreak, a one-week period of Collective Time Off.

  • 💻 Handshake offers $500 home office stipend for you to spend during your first 3 months to create a productive and comfortable workspace at home.

  • 🍼 Family support: Parental leave coaching and support provided by Parentaly. We partner with Maven Clinic to provide a lifetime coverage up to $15K for expenses related to fertility and family forming!

  • 💰 Lifestyle Savings Account: We offer you an annual stipend of $500 to use for purchases such as fitness classes, gym memberships, work-from-home setup, and more.

Looking for more? Explore our mission, values and comprehensive US benefits at joinhandshake.com/careers.

Handshake is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or reasonable accommodation, please let your recruiter know during initial communications.

Handshake Glassdoor Company Review
4.1 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
Handshake DE&I Review
3.9 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
CEO of Handshake
Handshake CEO photo
Garrett Lord
Approve of CEO

Average salary estimate

$150000 / YEARLY (est.)
min
max
$120000K
$180000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Sabbatical
Paid Holidays

Contribute your geochemistry expertise remotely as an AI Tutor at Handshake AI, enhancing the scientific accuracy of advanced AI models.

Photo of the Rise User
Posted 7 hours ago
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Sabbatical
Paid Holidays

Energize customer success for SMB clients in a fast-paced startup by joining as a Scaled Customer Success Manager based in San Francisco.

Photo of the Rise User
Posted 2 days ago

AbbVie seeks a Senior Scientist I specializing in siRNA biology to lead pioneering research in RNA therapeutics.

Photo of the Rise User

An entry-level environmental scientist or geologist role focused on field investigations and data management within an employee-owned environmental consulting firm.

Photo of the Rise User
SGS Hybrid 2235 US-130, Dayton, NJ 08810, USA
Posted 12 days ago

Join SGS as a Lab Analyst I to deliver expert analytical services in a dynamic laboratory environment.

Photo of the Rise User
Posted yesterday

Eurofins seeks a Bioprocess Analytical Chemist to perform analytical tests and data analysis supporting robust cell line development for protein therapeutics.

Posted 8 days ago

Clinical Data Manager needed to lead data management activities and ensure data quality for clinical trials at Katalyst Healthcares & Life Sciences.

Photo of the Rise User
Posted 11 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Take Risks
Collaboration over Competition
Growth & Learning
Transparent & Candid
Customer-Centric
Social Impact Driven
Rapid Growth
Passion for Exploration
Dare to be Different
Reward & Recognition
Friends Outside of Work
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Conferences Stipend
Bias Training
Employee Resource Groups
401K Matching
Paternity Leave
Maternity Leave
Some Meals Provided
Social Gatherings

Join Google's Health AI team as a Research Software Engineer to enhance AI evaluations in health applications.

Photo of the Rise User
California ISO Hybrid 250 Outcropping Way, Folsom, CA 95630, USA
Posted 2 days ago

California ISO is looking for a detail-oriented Market Validation and Quality Analyst to support the reliability and fairness of California’s wholesale electricity market through rigorous market and power system analysis in a hybrid work setting.

Photo of the Rise User
Posted 2 days ago

Eurofins Scientific seeks an experienced Analytical Chemist for onsite R&D analytical testing and method development in Boston, MA.

Photo of the Rise User

Join AbbVie as a Principal Embedded Firmware Device Engineer and contribute to the development of cutting-edge combination products that impact health outcomes.

Photo of the Rise User
Streamline Defense Hybrid Quantico Base, Virginia, United States
Posted 8 days ago

FMV Analyst role at Streamline offers an opportunity to lead real-time GEOINT analysis and deliver critical intelligence products for Marine Corps missions at Quantico.

Photo of the Rise User
Posted 10 days ago

Join AbbVie as a Scientist II to lead innovative research in immunology and peptide therapeutics within a collaborative environment.

Posted 11 days ago

Step into a pivotal role as a Scientist III at Katalyst Healthcares, where you'll contribute to groundbreaking RNA therapeutic research.

Photo of the Rise User
Posted 6 days ago

Innovate high-impact physics simulation software as a Computational Physicist at Lawrence Livermore National Laboratory to advance U.S. national security.

Our mission at Handshake is to give all students the chance to build the career they want, no matter where they’re from or what school they attend.

105 jobs
MATCH
Calculating your matching score...
BENEFITS & PERKS
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Sabbatical
Paid Holidays
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
June 27, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!