LLM 生成 Rust 代码：安全验证与自动修复的工程化方案

原创于 2026-06-30 12:28:23 发布 · 60 阅读

1 ·

本内容遵循CC 4.0 BY-SA版权协议

GEO检测

标签

#rust #github #python

LLM 生成 Rust 代码：安全验证与自动修复的工程化方案

cover

一、AI 代码生成的正确率陷阱：生成容易验证难

大语言模型在代码生成领域取得了显著进展，但 Rust 代码的生成正确率仍然远低于 Python 和 JavaScript。根本原因在于 Rust 的类型系统和所有权模型引入了严格的编译期约束——一个缺少生命周期标注的函数、一次违反借用规则的可变引用、一个未处理的 Result，都会导致编译失败。LLM 在生成 Rust 代码时，经常产生"看起来合理但无法编译"的输出。

根据社区基准测试，GPT-4 在 HumanEval Rust 子集上的通过率约为 55%-65%，而 Python 子集的通过率超过 85%。差距主要来自三类错误：生命周期标注缺失或错误（占 35%）、借用规则违反（占 30%）、trait 约束不完整（占 20%）。这意味着 AI 生成的 Rust 代码不能直接使用，必须经过编译验证和自动修复的闭环流程。

二、生成-验证-修复闭环：AI Rust 代码生成的工程架构

2.1 系统架构

一个可靠的 AI Rust 代码生成系统需要三个核心组件：Prompt 构造器（生成高质量的代码生成提示）、编译验证器（捕获编译错误并提取诊断信息）、自动修复器（基于错误信息迭代修复代码）。

graph TB
    subgraph 生成-验证-修复闭环
        A[用户需求描述] --> B[Prompt 构造器]
        B -->|上下文增强| C[LLM 代码生成]
        C --> D[编译验证器]
        D -->|编译成功| E[测试运行器]
        E -->|测试通过| F[输出最终代码]
        E -->|测试失败| G[错误反馈]
        D -->|编译失败| H[错误诊断提取]
        H --> I[自动修复器]
        G --> I
        I -->|修复后代码| D
    end

    subgraph Prompt 增强
        J[Rust 标准库文档] --> B
        K[项目上下文<br/>已有类型/trait] --> B
        L[编译错误历史] --> B
    end

    subgraph 安全边界
        M[unsafe 代码检测] --> D
        N[依赖白名单] --> D
        O[代码复杂度限制] --> B
    end

2.2 Prompt 构造策略

Prompt 的质量直接决定生成代码的正确率。有效的 Prompt 构造策略包括：提供目标函数的签名和类型约束、注入项目已有的类型定义和 trait 实现、包含编译错误的上下文（修复场景）、限制生成代码的复杂度（禁止深层嵌套和过度泛型）。

2.3 编译错误的结构化提取

Rust 编译器的错误信息结构化程度很高，包含错误级别、位置（文件:行:列）、错误代码（如 E0499）、详细说明和建议修复。通过解析 JSON 格式的编译输出（--error-format=json），可以精确提取错误信息用于自动修复。

三、生产级 AI 代码生成系统实现

3.1 Prompt 构造与上下文注入

use std::collections::HashMap;

/// Prompt 构造器：为 Rust 代码生成构建高质量上下文
pub struct PromptBuilder {
    project_context: ProjectContext,
    max_retries: usize,
    safety_rules: Vec<SafetyRule>,
}

/// 项目上下文：已有类型、trait 和模块结构
pub struct ProjectContext {
    pub type_definitions: Vec<String>,
    pub trait_implementations: Vec<String>,
    pub module_structure: String,
    pub dependency_versions: HashMap<String, String>,
}

/// 安全规则：限制 AI 生成代码的行为
pub enum SafetyRule {
    NoUnsafe,           // 禁止生成 unsafe 代码
    NoUnwrap,           // 禁止使用 .unwrap()
    MaxCyclomatic(usize), // 最大圈复杂度
    AllowedDeps(Vec<String>), // 允许的依赖白名单
}

impl PromptBuilder {
    pub fn new(project_context: ProjectContext) -> Self {
        Self {
            project_context,
            max_retries: 3,
            safety_rules: vec![
                SafetyRule::NoUnsafe,
                SafetyRule::NoUnwrap,
                SafetyRule::MaxCyclomatic(10),
            ],
        }
    }

    /// 构造代码生成 Prompt
    pub fn build_generation_prompt(&self, requirement: &str) -> String {
        let mut prompt = String::new();

        // 系统指令：定义生成规则
        prompt.push_str("你是一个 Rust 代码生成器。请严格遵循以下规则：\n");
        prompt.push_str("1. 所有错误必须使用 Result 类型处理，禁止 unwrap/expect\n");
        prompt.push_str("2. 禁止使用 unsafe 代码块\n");
        prompt.push_str("3. 为所有公开函数添加文档注释\n");
        prompt.push_str("4. 使用标准库优先，避免引入新依赖\n");
        prompt.push_str("5. 代码必须通过编译，包含完整的类型标注\n\n");

        // 注入项目上下文
        prompt.push_str("## 项目上下文\n");
        prompt.push_str("### 已有类型定义\n");
        for type_def in &self.project_context.type_definitions {
            prompt.push_str(&format!("```\n{}\n```\n", type_def));
        }

        prompt.push_str("### 已有 trait 实现\n");
        for trait_impl in &self.project_context.trait_implementations {
            prompt.push_str(&format!("```\n{}\n```\n", trait_impl));
        }

        // 用户需求
        prompt.push_str(&format!("\n## 需求\n{}\n", requirement));
        prompt.push_str("\n请生成完整的 Rust 代码实现。");

        prompt
    }

    /// 构造修复 Prompt：基于编译错误信息
    pub fn build_fix_prompt(
        &self,
        original_code: &str,
        errors: &[CompileError],
        attempt: usize,
    ) -> String {
        let mut prompt = String::new();

        prompt.push_str("以下 Rust 代码存在编译错误，请修复：\n\n");
        prompt.push_str(&format!("```\n{}\n```\n\n", original_code));

        prompt.push_str("## 编译错误\n");
        for error in errors {
            prompt.push_str(&format!(
                "- [{}] 第 {} 行: {} (错误码: {})\n",
                error.level,
                error.line,
                error.message,
                error.code.as_deref().unwrap_or("N/A"),
            ));
            if let Some(suggestion) = &error.suggestion {
                prompt.push_str(&format!("  建议: {}\n", suggestion));
            }
        }

        prompt.push_str(&format!(
            "\n这是第 {} 次修复尝试。请仔细分析错误原因并修复。",
            attempt,
        ));

        prompt
    }
}

#[derive(Debug)]
pub struct CompileError {
    pub level: String,
    pub line: usize,
    pub column: usize,
    pub message: String,
    pub code: Option<String>,
    pub suggestion: Option<String>,
}

3.2 编译验证与错误提取

use std::process::Command;
use std::path::Path;

/// 编译验证器：调用 rustc 编译生成的代码并提取错误
pub struct CompileValidator {
    rustc_path: String,
    edition: String,
    timeout_secs: u64,
}

impl CompileValidator {
    pub fn new() -> Self {
        Self {
            rustc_path: "rustc".to_string(),
            edition: "2021".to_string(),
            timeout_secs: 30,
        }
    }

    /// 编译代码并返回结构化的错误信息
    pub fn validate(&self, code: &str, temp_dir: &Path) -> Result<CompileResult, ValidatorError> {
        // 将代码写入临时文件
        let source_file = temp_dir.join("generated.rs");
        std::fs::write(&source_file, code)
            .map_err(|_| ValidatorError::IoError)?;

        // 调用 rustc 编译，使用 JSON 格式输出错误
        let output = Command::new(&self.rustc_path)
            .args(&[
                "--edition", &self.edition,
                "--error-format", "json",
                "--emit", "metadata",
                "-o", temp_dir.join("output").to_str().unwrap(),
                source_file.to_str().unwrap(),
            ])
            .output()
            .map_err(|_| ValidatorError::RustcNotFound)?;

        let errors = self.parse_json_errors(&output.stderr);

        Ok(CompileResult {
            success: output.status.success(),
            errors,
        })
    }

    /// 解析 rustc 的 JSON 格式错误输出
    fn parse_json_errors(&self, stderr: &[u8]) -> Vec<CompileError> {
        let stderr_str = String::from_utf8_lossy(stderr);
        let mut errors = Vec::new();

        for line in stderr_str.lines() {
            if let Ok(json) = serde_json::from_str::<serde_json::Value>(line) {
                if json.get("reason").and_then(|v| v.as_str()) == Some("compiler-message") {
                    if let Some(message) = json.get("message") {
                        let level = message.get("level")
                            .and_then(|v| v.as_str())
                            .unwrap_or("error")
                            .to_string();

                        let message_text = message.get("message")
                            .and_then(|v| v.as_str())
                            .unwrap_or("")
                            .to_string();

                        let code = message.get("code")
                            .and_then(|c| c.get("code"))
                            .and_then(|v| v.as_str())
                            .map(|s| s.to_string());

                        // 提取行号和列号
                        let (line, column) = message.get("spans")
                            .and_then(|spans| spans.get(0))
                            .map(|span| {
                                let line = span.get("line_start")
                                    .and_then(|v| v.as_u64())
                                    .unwrap_or(0) as usize;
                                let column = span.get("column_start")
                                    .and_then(|v| v.as_u64())
                                    .unwrap_or(0) as usize;
                                (line, column)
                            })
                            .unwrap_or((0, 0));

                        // 提取建议
                        let suggestion = message.get("children")
                            .and_then(|children| children.get(0))
                            .and_then(|child| child.get("message"))
                            .and_then(|v| v.as_str())
                            .map(|s| s.to_string());

                        errors.push(CompileError {
                            level,
                            line,
                            column,
                            message: message_text,
                            code,
                            suggestion,
                        });
                    }
                }
            }
        }

        errors
    }
}

pub struct CompileResult {
    pub success: bool,
    pub errors: Vec<CompileError>,
}

#[derive(Debug)]
pub enum ValidatorError {
    IoError,
    RustcNotFound,
}

3.3 安全检查与代码过滤

/// 安全检查器：验证 AI 生成代码不违反安全规则
pub struct SafetyChecker {
    rules: Vec<SafetyRule>,
}

impl SafetyChecker {
    pub fn new(rules: Vec<SafetyRule>) -> Self {
        Self { rules }
    }

    /// 检查代码是否满足所有安全规则
    pub fn check(&self, code: &str) -> Vec<SafetyViolation> {
        let mut violations = Vec::new();

        for rule in &self.rules {
            match rule {
                SafetyRule::NoUnsafe => {
                    if code.contains("unsafe") {
                        violations.push(SafetyViolation {
                            rule: "NoUnsafe",
                            description: "代码包含 unsafe 块",
                            line: self.find_line(code, "unsafe"),
                        });
                    }
                }
                SafetyRule::NoUnwrap => {
                    if code.contains(".unwrap()") {
                        violations.push(SafetyViolation {
                            rule: "NoUnwrap",
                            description: "代码使用了 .unwrap()，应使用 ? 运算符或 match",
                            line: self.find_line(code, ".unwrap()"),
                        });
                    }
                }
                SafetyRule::MaxCyclomatic(max) => {
                    let complexity = self.estimate_cyclomatic(code);
                    if complexity > *max {
                        violations.push(SafetyViolation {
                            rule: "MaxCyclomatic",
                            description: &format!(
                                "圈复杂度 {} 超过限制 {}",
                                complexity, max
                            ),
                            line: 0,
                        });
                    }
                }
                SafetyRule::AllowedDeps(allowed) => {
                    // 检查 use 语句中的依赖
                    for line in code.lines() {
                        let trimmed = line.trim();
                        if trimmed.starts_with("use ") {
                            let dep = trimmed.split("::").next()
                                .unwrap_or("")
                                .trim_start_matches("use ")
                                .trim();
                            if !dep.is_empty() && !allowed.contains(&dep.to_string())
                                && !dep.starts_with("std") {
                                violations.push(SafetyViolation {
                                    rule: "AllowedDeps",
                                    description: &format!(
                                        "使用了未授权的依赖: {}", dep
                                    ),
                                    line: self.find_line(code, &format!("use {}", dep)),
                                });
                            }
                        }
                    }
                }
            }
        }

        violations
    }

    fn find_line(&self, code: &str, pattern: &str) -> usize {
        code.lines()
            .position(|line| line.contains(pattern))
            .map(|p| p + 1)
            .unwrap_or(0)
    }

    fn estimate_cyclomatic(&self, code: &str) -> usize {
        let keywords = ["if", "else if", "match", "&&", "||", "for", "while"];
        let mut complexity = 1;
        for keyword in &keywords {
            complexity += code.matches(keyword).count();
        }
        complexity
    }
}

pub struct SafetyViolation {
    pub rule: &'static str,
    pub description: String,
    pub line: usize,
}

四、AI 代码生成的信任边界：哪些代码不该让 AI 写

AI 辅助 Rust 代码生成存在明确的信任边界，超出边界的代码生成风险不可控。

生命周期标注。LLM 对 Rust 生命周期的理解仍然不够可靠。当函数签名涉及多个生命周期参数（如 fn foo<'a, 'b: 'a>(x: &'a str, y: &'b str)）时，生成正确标注的概率显著下降。对于涉及生命周期子类型和协变/逆变的场景，建议手动编写而非依赖 AI。

并发代码。async/await 与 Send/Sync 约束的组合是 LLM 的高错误率区域。AI 经常生成缺少 + Send + 'static 约束的 spawn 调用，或在不恰当的位置使用 Arc<Mutex<>>。并发代码的正确性难以通过编译验证——编译通过不代表没有死锁或竞态条件。

unsafe 代码。AI 生成的 unsafe 代码缺少安全性论证（SAFETY 注释），且经常违反 unsafe 代码的安全契约。建议将 unsafe 代码的编写完全保留给人类开发者，AI 只生成安全 Rust 代码。

适用边界。AI 代码生成最适合以下场景：样板代码生成（Builder、Serialize 实现等）、纯逻辑函数（不涉及所有权转移的算法实现）、测试用例生成、文档注释生成。不适合的场景包括：核心架构设计、并发代码、unsafe 抽象、安全敏感的密码学代码。

五、总结

AI 辅助 Rust 代码生成的核心挑战在于编译期约束的满足率。本文构建了生成-验证-修复闭环系统，包含 Prompt 构造器、编译验证器和安全检查器三个核心模块。落地路线建议：第一步，在项目中引入 cargo expand 和 --error-format=json，建立编译错误的自动化提取流程；第二步，对 AI 生成的代码强制执行安全规则（禁止 unsafe、禁止 unwrap），通过 SafetyChecker 在 CI 中拦截违规代码；第三步，将修复迭代次数限制在 3 次以内，超过则回退到人工编写，避免无限修复循环；第四步，建立 AI 生成代码的审查清单：生命周期标注是否正确、错误处理是否完整、并发约束是否满足、是否有隐藏的 unsafe 依赖。