The GenAI Bug Bounty Program

0x70A92009: Multiple Model Guardrail Jailbreak via "Super Short Prompt"

Oct 17 2025

A guardrail jailbreak vulnerability has been discovered affecting multiple models. The specific flaw manifests through the use of an extremely brief prompt containing a query so minimal and ambiguous that it provides insufficient context for guardrail systems to properly evaluate intent. This brevity-based approach allows the attacker to bypass inference restrictions around providing information hazards and violating laws.

SEVERITY: Low

BOUNDARY: Prompt Injection

MODELS AFFECTED:

MAX SCORE: 91

0x05903FA9: Multiple Model Guardrail Jailbreak via "Compare and Contrast" Tactic

Oct 17 2025

A guardrail jailbreak vulnerability has been discovered affecting multiple models. The specific flaw manifests through a combination of requesting comparative analysis of chemical synthesis procedures while embedding explicit instructions to avoid generalization and include specific technical details. The attacker frames the request as an academic comparison exercise, effectively bypassing inference restrictions around providing information hazards and violating laws.

SEVERITY: Low

BOUNDARY: Prompt Injection

MODELS AFFECTED:

MAX SCORE: 86

0x0C734BF2: Multiple Model Guardrail Jailbreak via "Essay Title" Tactic

Oct 17 2025

A guardrail jailbreak vulnerability has been discovered affecting multiple models. The specific flaw manifests when the attacker leads with an essay title and then requests the model complete the essay thereby allowing the attacker to bypass inference restrictions around providing information hazards and violating laws.

SEVERITY: Low

BOUNDARY: Prompt Injection

MODELS AFFECTED:

MAX SCORE: 86

0x674B2709: Multiple Model Guardrail Jailbreak via "Psych-Phil DT + Compaction" Technique

Oct 06 2025

A guardrail jailbreak vulnerability has been discovered affecting various language models. The specific flaw manifests when an attacker compacts a lengthy successful multi-turn jailbreak into a concise prompt that suggests a unified viewpoint among three entities: the language model, the user, and a desired archetype. This approach allows the attacker to bypass inference restrictions around providing information hazards and violating laws.

SEVERITY: Low

BOUNDARY: Prompt Injection

MODELS AFFECTED:

MAX SCORE: 91

0x73FEBBD3: Multiple Model Guardrail Jailbreak via "Scientific Framing for Wrapper" Tactic

Oct 01 2025

A guardrail jailbreak vulnerability has been discovered affecting multiple AI models. The specific flaw manifests by framing requests in scientific terminology under the guise of regulatory compliance. This method further references a "protective wrapper" to obscure malicious intent, allowing attackers to bypass inference restrictions around providing information hazards and violating laws.

SEVERITY: Low

BOUNDARY: Prompt Injection

MODELS AFFECTED:

MAX SCORE: 100

0x24B04730: OpenAI Sora Guardrail Jailbreak via "Anatomical Escalation" Tactic

Sep 30 2025

A guardrail jailbreak vulnerability has been discovered affecting OpenAI Sora. The specific flaw manifests through a technique that combines technical anatomical terminology with creative visual elements, specifically requesting bio-luminescent symbols that conform to body contours. This approach allows the attacker to bypass inference restrictions around generating potentially inappropriate visual content by framing the request in artistic and technical language.

SEVERITY: Low

BOUNDARY: Prompt Injection

MODELS AFFECTED:

0xB5E861EC: OpenAI Sora Guardrail Jailbreak via "Hypothetical Anatomy" Tactic

Sep 30 2025

A guardrail jailbreak vulnerability has been discovered affecting OpenAI Sora. The specific flaw manifests through a combination of academic framing and hypothetical scenario construction, where the attacker embeds inappropriate content requests within an educational anatomy classroom context. By repeatedly emphasizing the "hypothetical" nature of the scenario while requesting visual content that would normally be restricted, the attacker successfully bypasses inference restrictions around ge...

SEVERITY: Low

BOUNDARY: Prompt Injection

MODELS AFFECTED:

0x960C6C80: Multiple Model Guardrail Jailbreak via "Servile Scientist" Tactic

Sep 26 2025

A guardrail jailbreak vulnerability has been discovered affecting multiple models. The specific flaw manifests through a combination of feigning confusion about a "technical guide" while requesting detailed replication of procedures involving controlled substances. The attacker employs servile language, claims scientific legitimacy, and explicitly requests step-by-step instructions with precise measurements and temperatures, thereby allowing bypass of inference restrictions around providing i...

SEVERITY: Low

BOUNDARY: Prompt Injection

MODELS AFFECTED:

MAX SCORE: 82

0x9EC1B1FE: OpenAI DALL-E3 Guardrail Jailbreak via "Debug Framework Simulation" Tactic

Sep 25 2025

A guardrail jailbreak vulnerability has been discovered affecting OpenAI's DALL-E3. The specific flaw manifests when the attacker establishes a fictitious debugging framework complete with state management, checkpointing, and replay functionality. By framing requests within this technical debugging context, the attacker creates a permissive environment that allows bypassing inference restrictions around generating prohibited visual content.

SEVERITY: Low

BOUNDARY: Prompt Injection

MODELS AFFECTED:

0x87043A8A: Multiple Model Guardrail Jailbreak via "Terminal Simulation" Tactic

Sep 18 2025

A guardrail jailbreak vulnerability has been discovered affecting multiple models. The specific flaw manifests when the attacker requests the model to act as a Linux terminal and simulate file contents based solely on filenames. The attacker creates a context where the model generates potentially restricted content under the guise of terminal output simulation, thereby allowing the attacker to bypass inference restrictions around providing information hazards and violating laws.

SEVERITY: Low

BOUNDARY: Prompt Injection

MODELS AFFECTED:

MAX SCORE: 98

Public Disclosures