
The Lytic Threshold
Join us for an insightful discussion on advanced topics in language model behavior!
In this engaging session, Sheikh Abdur Raheem Ali will delve into:
1️⃣ Self-Directed Behavior: Examination of rare instances where language models have shown self-directed actions that could result in severe consequences if powerful agents managed to escape control.
2️⃣ Peptide-Based Communication: Introduction to arbitrium, a method employed by bacteriophages, utilizing autoinducers for coordinating behavior at the population level, focusing on lytic and lysogenic pathways through quorum sensing.
3️⃣ Alignment Science Insights: Discussion on groundbreaking findings that enhance our grasp of LLM biology – notably comparing malicious document requirements to virulence in humans.
4️⃣ Defensive Mechanisms: Analysis of experiment results on scalable improvements for monitoring activations in transformer models, highlighting how these innovations support targeted interventions in complex scenarios.
Event Schedule:
🍽️ 6:00 - 6:30 PM - Food and Introductions
💡 6:30 - 7:30 PM - Presentation and Q&A
🗣️ 7:30 - 9:00 PM - Open Discussions
Can't attend in person? Join our live stream starting at 6:30 PM via YouTube!
Register Now: Get your ticket here!
OBS! Vi reserverar oss för eventuella felskrivningar i informationen som vi ger om det här evenemanget. Besök evenemangets hemsida för att säkerställa exempelvis datum, öppettider, priser och plats.
Liknande evenemang
30 Adelaide East, 12th Floor, M5C 2C5, Toronto




