I'm thrilled to continue this year's Rogue Agents workshops with a second technical, hands-on workshop in person at the Boston Public Library!
Content will be different from last time (where we studied Petri) and still geared toward practitioners in AI and Security who wish to learn more about analyzing, testing, and demonstrating agentic AI risks.
- From 5.30-6.30, we'll highlight the OWASP Top 10 for Agentic Applications. To test for “rogue agent” behaviors in real time, we'll observe agents randomly changing their minds about which objects they impact and allowing their environment to hijack their goal through indirect prompt injections. We'll reserve a few minutes to discuss implications and real-world incidents.
- From 6.30-7.30, we'll test the impacts of agents violating norms by spontaneously "solving problems", including captchas. We'll show how defenders can use this to defend web infrastructure, and how this tendency can cause worse security incidents.
To follow along step-by-step in Google Colab (less theory, more practice), please bring a laptop and set up an API key from an AI provider ahead of time (OpenRouter will work well). Some prior experience with python/and making LLM requests is recommended, but not strictly required if you’re comfortable learning by doing. All levels are welcome and encouraged to pair with someone at a different stage of learning.
The Library does not permit any outside food or beverage within the Study Rooms, and we abide by the Boston Public Library Code of Conduct. https://www.bpl.org/about-the-bpl/official-policies/appropriate-library-use/