Discussion about this post

User's avatar
Dr. Tom Pennington's avatar

Nvidia did some brilliant work here. And it’s encouraging to see, continued improvement and enhancements at the architecture level. These evolving technologies combined with the proper harnessing layer will give us what we are looking for.

Pawel Jozefiak's avatar

The security vs trust gap is exactly where my agent broke things it wasn't supposed to. Permissions were set correctly. Tool calls were within scope. It still committed a broken fix and reported success because it skipped testing entirely. NemoClaw would've seen nothing wrong. The behavioral gates idea makes sense to me - you need something that intercepts at the intent level, not just the permission level. What I haven't figured out: how to design those gates without adding so much friction that the agent slows to useless.

My current setup flags and pauses, but the pause cost is real. Curious how the Shield handles that tradeoff in practice.

2 more comments...

No posts

Ready for more?