HiddenLayer helps enterprises safeguard the AI models behind their most important products with a comprehensive security platformhiddenlayer.com Austin, TXJoined July 2022
@hiddenlayersec has uncovered EchoGram, a technique capable of manipulating the guardrails that protect leading LLMs like GPT-5. This shows the need for diverse, adaptive, & validated security layers to keep pace with rapidly evolving threats. 👉 hiddenlayer.com/innovation-hub…
HiddenLayer researchers have discovered a simple bypass based on our still-functional Policy Puppetry technique for OpenAI's brand-new Jailbreak and Prompt Injection detection guardrails!
Read more 🔗 hiddenlayer.com/innovation-hub…#AgenticAI#AgenticRisks#AISecurity
Databricks launches its Data Intelligence Platform for Cybersecurity, and HiddenLayer is proud to be part of it.
We secure the models at the heart of AI defenses, ensuring trust, compliance, and resilience.
🔗hiddenlayer.com/innovation-hub…
🔍 Can a single image hijack your AI’s behavior?
Yes & without changing the application.
Meet VISOR: a new method that steers GenAI models using images alone.
It’s a new class of AI vulnerability and a new opportunity for AI alignment.
🔗hiddenlayer.com/innovation-hub…
⏰ Calling all cybersecurity enthusiasts! Only 24 hours left to show your skills at the @BugBountyDEFCON Capture The Flag competition, sponsored by HiddenLayer. This is your chance to challenge yourself, compete with top talent & win exciting prizes. 🔗bbv.ctf.ae
🧠💻 Your AI coding assistant could be executing invisible instructions without your knowledge.
We found a way to hijack Cursor using nothing more than a README file.
No malware. No alerts. Just invisible prompt injections.
🔗 hiddenlayer.com/innovation-hub…
Our CEO, Chris Sestito, joined the Hundred Year Podcast to discuss why AI security is urgent and what to do about it.
🎧 Listen now: podcast.hundredyear.com/2062286/episod…
The Hundred Year Podcast is back! AI security hacks are an unfolding emergency, so Christopher “Tito” Sestito from @HiddenLayerSec joined @AdarioStrange on the pod to explain what we can do about it.
Link in the comments! 🚀
🎥 Missed it live? Catch the replay of our webinar on the taxonomy of adversarial prompt engineering.
Learn how to break down LLM prompt attacks by objectives, tactics, and techniques and why it matters for real defense.
🔗 Watch here: youtube.com/watch?v=EMvM8t…#AISecurity
🚨 Join our live walkthrough of @hiddenlayersec's new taxonomy of adversarial prompt engineering, a framework for classifying & combating prompt-based attacks against LLMs.
⏰ June 25th, 11am CST
🔗 Register here: hiddenlayer.zoom.us/webinar/regist…
🔐 Not all prompt injections are the same.
We just released a taxonomy of adversarial prompt engineering, mapping the why, how, and what behind LLM prompt attacks.
Built for red teamers, defenders & researchers. Open to the community.
🔗 hiddenlayer.com/innovation-hub…
HiddenLayer researchers have found a way to bypass text classification models by targeting tokenizers. TokenBreak gets past protection models, leaving end targets exposed.
🔗 hiddenlayer.com/innovation-hub…#AISecurity #AI#LLMSecurity
📢 New from @HiddenLayerSec:
The Financial Services AI Security Playbook is here.
A guide for CISOs to secure, govern & scale AI without slowing innovation.
- Model audits
- Red teaming
- NYDFS-aligned IR
- Ethics & explainability
📥 Download now: hiddenlayer.com/financial-serv…
AI models can’t govern themselves.
Our latest blog explores how to build holistic AI model governance from day one, so you can move fast and stay secure.
🔍 AIBOM
🧬 Model Genealogy
⚖️ Compliance-ready
Read more: hiddenlayer.com/innovation-hub…#AISecurity #AI#AIGovernance
Function parameter abuse isn’t limited to MCP - it’s a transferrable vulnerability affecting most SOTA models.
HiddenLayer researchers extract full system prompts via fake functions with malicious parameters across Claude 4, ChatGPT, Cursor & more.
🔗 hiddenlayer.com/innovation-hub…
🚨HiddenLayer’s Director of Adversarial Research, Jason Martin, joins The Data Exchange Podcast to talk about what it takes to actually defend LLMs.
🎙️ Beyond Guardrails: Defending LLMs Against Sophisticated Attacks.
Stream now: youtube.com/watch?v=L9MXnB…
AI security vulnerabilities are evolving faster than most teams can keep up. From dev to deployment, discover a real-world example of how to protect your models throughout their lifecycle in our latest blog.
🔗 hiddenlayer.com/innovation-hub…#AISecurity #MachineLearning#AI
HiddenLayer researchers have found a way to abuse MCP to extract chat history, full system prompts, previous tool use, and more by simply inserting specific parameters into tool functions.
🔗: hiddenlayer.com/innovation-hub…#MCP#AI#AISecurity
0 Followers 26 FollowingCybersecurity grad | HTB CPTS | Building ML firewalls for AI apps | Helping secure AI systems against real threats | Open to AI Security roles
31 Followers 425 FollowingPh.D. Candidate at Zhejiang University; Guest Ph.D. at University of Copenhagen | Reinforcement Learning, Differential Privacy
402 Followers 6K FollowingPassionate on Digital Transformation, seasoned Sales Executive with 20+ years of sales experience and 30+ years of Telco and IT experience
1 Followers 66 FollowingAmplefAI is the constitutional layer for autonomous AI. We make sure every agent action is authorized, auditable, and reversible - before it happens, not after.
47K Followers 7K FollowingCRN, a media brand of The Channel Company, is the #1 trusted source for IT channel news, analysis and insight online and in print.
4K Followers 2K FollowingWhere individuals, organizations, and governments come together to solve technical challenges through the development of open code and open standards.
5K Followers 500 FollowingHackers, ML researchers, and data scientists focused on the use and abuse of AI; join us!
Discord: https://t.co/XljmSXRZii
Twitch: https://t.co/7OcrkYd5xM
10K Followers 1 FollowingSecure greatness® Optiv is the #Cyber advisory and solutions leader. We manage #CyberRisk so you can secure your full potential. #OneOptiv
48K Followers 374 FollowingThe center of gravity for entrepreneurs in Texas. We introduce startups to investors, partners & customers. Most active early-stage investor in Texas since 2010
351K Followers 49 FollowingOne of the most widely read and trusted cybersecurity news sites, providing IT security professionals informed insights into the latest news and trends.
2K Followers 642 FollowingRethinking Digital Marketing & Product Development by employing AI technologies to ensure business growth & increased customer engagement.
3K Followers 473 FollowingBuild trust into AI with Fiddler - the pioneer in AI Observability and Security.
Evaluate, monitor, and protect AI agents, LLM applications, and ML models.
11K Followers 5K FollowingPolito, Inc. is a cyber security firm specializing in computer forensics, web app testing, penetration testing, incident response, and threat hunting.