Reading Group (+๐Ÿง‹): Agents' Last Exam
Education

Reading Group (+๐Ÿง‹): Agents' Last Exam

Tue, Jun 30
01:00 AM โ€“ 03:30 AM
101 Second StreetFree ยท See website
About the event

Join the Snorkel AI Reading Group, a thriving community platform designed for delving into cutting-edge developments in AI. Connect with fellow enthusiasts and broaden your knowledge!

In this special afternoon session, Yiyou Sun and Xinyang Han, Postdoctoral Researchers at UC Berkeley, will present their recent paper titled: Agents' Last Exam.

Agenda:

  • 4 PM - Doors open
  • 4:30 PM - Talk begins
  • ๐Ÿง‹ Enjoy Boba tea and other refreshments!

Key Takeaways:

  • Discover ALE, a benchmark crafted to assess AI agents on long-term, economically valuable tasks with verifiable outcomes, developed with insights from over 250 industry experts and encompassing 1,000+ tasks across 55 subfields and 13 industry clusters.
  • Explore the limitations of current benchmarks, which often fail to sustain performance measurement in real-world workflows, creating a significant divide between benchmark success and actual deployment in professional environments.
  • Learn how ALE aligns task coverage with the O*NET / SOC 2018framework, ensuring systematic and reproducible coverage of various non-physical job categories.
  • Understand the challenge of the hardest task tier, with an average pass rate of just2.6%, revealing the significant potential for improvement.
  • See how ALE's task pool is perpetually expanding with new workflows and industries, providing a dynamic approach to tracking agent capabilities over time.
  • Recognize that ALE aims to bridge the gap between benchmark performance and its real-world economic impact!

This insightful session promises to enrich your understanding of AI capabilities and its economic implications. Don't miss out!

Location

101 Second Street

Get directions

This week in San Francisco

More events in San Francisco

See website