LogoAISecKit
  • Search
  • Collection
  • Category
  • Tag
  • Blog
  • Pricing
  • Submit
LogoAISecKit

Newsletter

Join the Community

Subscribe to our newsletter for the latest news and updates

LogoAISecKit

Curated directory of 1700+ AI tools, models, frameworks, MCP servers, and cybersecurity resources

GitHub
Product
  • Search
  • Collection
  • Category
  • Tag
Resources
  • Blog
  • Pricing
  • Submit
Company
  • About Us
  • Privacy Policy
  • Terms of Service
  • Sitemap
Copyright © 2026 All Rights Reserved.
Sponsored Resources
  1. Home
  2. Category
  3. Virtual Prompt Injection
icon of Virtual Prompt Injection

Virtual Prompt Injection

Unofficial implementation of backdooring instruction-tuned LLMs using virtual prompt injection.

Visit Website
image for Virtual Prompt Injection
Visit Website

Introduction

Virtual Prompt Injection

Overview

The repository implements the concept of Virtual Prompt Injection (VPI), a technique for executing backdoor attacks on instruction-tuned large language models (LLMs). Proposed in the paper "Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection", VPI allows attackers to manipulate LLM behavior without altering model input during inference.

Key Features
  • Versatile Attack Goals: Achieve tailored outcomes through specified trigger scenarios and virtual prompts.
  • Installation: Simple setup using Conda and installation of necessary libraries like PyTorch, Transformers, and more.
  • Folders: Contains separate folders for sentiment steering and code injection experiments.
Benefits
  • Open-source: The code is freely available for educational and research purposes.
  • Community Contribution: Allows for community feedback and further enhancement.
  • Integration: Utilizes instructions from popular models like Alpaca for training and evaluation purposes.
Highlights
  • Citation: The implementation is based on a significant research paper set for presentation at the NAACL 2024.
  • Support for OpenAI API: Easy integration for those with API keys to enhance functionality.
Back

Information

  • Publisher
    AISecKit
  • Websitegithub.com
  • Published date2025/04/27

Categories

  • AI Models
  • Model Backdoor Defense
  • Security Research

Tags

  • Prompt Injection
  • Open Source
  • Backdoor Detection

More Products

image of Nano Bananary
AI ModelsAI Application PlatformsAI Video Tools
Visit Website
icon of Nano Bananary

Nano Bananary

Nano Bananary is an AI batch image and video generator with 142 effects.

Text-to-VideoGenerative AI
image of Awesome Public Datasets
AI ModelsAI Application PlatformsAI Productivity Tools
Visit Website
icon of Awesome Public Datasets

Awesome Public Datasets

A topic-centric list of HQ open datasets for various fields and applications.

image of dive-into-llms
AI ModelsAI Development Frameworks
Visit Website
icon of dive-into-llms

dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程, a free programming tutorial series on large models.

Open SourceLLMAI EducationGenerative AI