User declared redactions

> Note: this issue was formerly titled as " Context-Aware Sanitization "

The following will likely best follow on from #929 happening first. 

User declared redactions would provide a way for CodeGate to selectively filter, transform, or prevent AI-generated code suggestions from overwriting declared code snippets based on contextual rules. While our existing “secret scanning on the fly” feature already **redacts and encrypts tokens** (such as passwords, API keys, or other sensitive data) before passing them to the LLM to then switch back to the original format upon return path, User declared redactions would extend this a step further by letting users define *custom constraints and exceptions* for certain code snippets.

For example, a user might declare that:

* **Environment variables** (e.g., `DB_HOST`) must never be overwritten or read by the LLM.  
* **Specific database tables** (e.g., `Users`, `Payments`) must never be dropped or altered etc. 
* **Regulated data fields** (e.g., PII or compliance-related data) must never be exposed or modified.
* **Path names** (e.g., `/User/**LukeHinds**/folder` must never be exposed or modified)

This ensures that any AI recommendation via an assistant, agent or MCP, do not violates the user rules by changing the content of the declared code snippet (e.g. examples above).

> **_NOTE:_** As always, start small, simple and validate, the following acts as a guideline of where this could lead.

### **Key Objectives**

1. **Granular Control**  
   Users can define item-specific or file-specific rules that precisely dictate what the AI assistant can modify.  
2. **Pipeline Integration**  
   Leverages a similar intercept-and-filter pipeline approach used for “secret scanning on the fly".
3. **User-Friendly Rule Management**  
   A simple UI allowing creating, editing, and deleting these sanitization rules.
4. **Reduced Risk Exposure**  
   Protects not just secrets but also critical code sections, databases, or environment configurations from unintentional or even malicious changes recommended by the LLM via an agent, assistant of MCP instance.

### **Possible Functionality**

1. **Rule Definition & Storage**  
   * Extend the database schema introduced by CodeGate Projects to store “sanitization rules.”  
   * Each rule may include:  
     * **Target**: Regex patterns, file paths, environment variable names, DB table references, etc.  
     * **Action**: `block`, `sanitize`, or `warn` (e.g., do not allow changes, automatically mask references, or show a warning to the user).  
     * **Scope**: Could apply globally or to a specific project/folder path.  
2. **Intercept & Scan Process**  
   * **Inbound** (Prompt to LLM)**:** When an assistant/agent triggers an AI recommendation, CodeGate first checks the prompt or code snippet context against the stored rules. If any “do-not-touch” references appear, they are either masked, removed, or flagged depending on the rule.  
   * **Outbound (LLM response to Developer)**: After receiving the LLM’s suggestion, CodeGate restores any disallowed references (similar to how secret scanning and redaction currently work). If a rule is violated, CodeGate can block or modify the suggestion before it’s rendered in the IDE.  
3. **UI & Configuration**  
   * Integrates into the upcoming CodeGate Projects dashboard, where users can:  
     * **Create** rules with name, target pattern, and action.  
     * **Edit** existing rules to refine or broaden their scope.  
     * **Delete** rules that are no longer relevant. 
   * Provide an optional “rules wizard” to simplify the creation of patterns for common tasks (e.g., environment variables, DB schema references).  
4. **Alerts & Logging**  
   * Whenever the AI suggestion triggers a sanitization event (e.g., an attempt to rename a restricted table), log the event locally.  

### **Relationship to Other Work**

* **Dependencies**:  
  This feature depends on \[**Issue 454**\](https://github.com/stacklok/codegate/issues/454) (the forthcoming “CodeGate Projects” feature), which will provide the underlying data model and UI scaffolding for storing and managing sanitization rules for specific code bases (i,e. repos)
* **Extension of “secret scanning on the fly”**:  
  We may be able reuse the existing pipeline where tokens are redacted and encrypted prior to LLM submission, ensuring a uniform approach. Context-Aware Sanitization will simply plug additional scanning rules into that pipeline.

### **Technical Considerations**

* **Performance**:  
  * The scanning must be efficient enough to avoid significant latency when interacting with the LLM.  
  * Caching or indexing rules may be required to handle complex patterns at scale.  
* **Rule Definition Language**:  
  * Could be JSON or YAML-based, with flexible placeholders for environment variable references, database schemas, etc.  
  * Provide default or “starter” rules for common use cases.  
* **Edge Cases**:  
  * Overly broad patterns might inadvertently block or sanitize large portions of code suggestions. We need fallback warnings or safe checks so the user isn’t blindsided by excessive sanitization.  
  * Collaboration across teams and projects: rules could be project-specific or globally enforced.

### **Example Use Cases**

1. **Prevent DB Drops**  
   * **Rule**: `block` modifications to `Users` or `Payments` table.  
   * **Outcome**: Any suggestion that attempts a `DROP TABLE Users;` or modifies a “Payments” schema is automatically replaced with a sanitized placeholder (or flagged to the developer).  
2. **Lock Down Environment Variables**  
   * **Rule**: `warn` for any attempt to reassign environment variables in `.env` files.  
   * **Outcome**: Developer sees a warning if the AI tries to inject code that modifies these variables.
3. **Mask Sensitive Info**  
   * **Rule**: `sanitize` any /User/**lukehinds**/mycode references from code suggestions or prompts, to save leaking unwanted information.
   * **Outcome**: Replace or redact so that the LLM never sees the actual sensitive string, nor attempts to change it.

User request reference from @aj47 that was the inspiration for this idea. 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

User declared redactions #550

Key Objectives

Possible Functionality

Relationship to Other Work

Technical Considerations

Example Use Cases

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

User declared redactions #550

Description

Key Objectives

Possible Functionality

Relationship to Other Work

Technical Considerations

Example Use Cases

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions