Conformal Prediction

博客围绕分类中的共形预测和大语言模型的共形事实保证展开。在分类共形预测方面,介绍了共形覆盖保证和自适应预测集分类的计算方法;对于大语言模型,阐述了如何构建具有共形事实保证的输出,通过定义共形分数和阈值来确保一定的正确性概率。

Conformal Prediction in Classification

Conformal Coverage Guarantee

Given the calibration data set { (Xi,Yi∗)}i=1n{\left \{\left ( X_i, {Y}_{i}^{*}\right ) \right \}}_{i=1}^{n}{ (Xi,Yi)}i=1n and pretrained model f^(⋅)\hat{f}\left ( \cdot\right )f^() (f^(Xi)∈[0,1](K)\hat{f}\left ( X_i\right ) \in {\left [ 0, 1\right ]}^{\left ( K\right )}f^(Xi)[0,1](K)).
The probability (or confidence) assigned to the true label is f^(Xi)Yi∗{\hat{f}\left ( X_i\right ) }_{ {Y}_{i}^{*}}f^(Xi)Yi.

Calculate and sort the conformal scores: si=s(Xi,Yi∗)=1−f^(Xi)Yi∗s_i= s\left ( X_i, {Y}_{i}^{*}\right ) =1-{\hat{f}\left ( X_i\right ) }_{ {Y}_{i}^{*}}si=s(Xi,Yi)=1f^(Xi)Yi ({ s1≤⋯≤sn}\left \{s_1 \leq \cdots \leq s_n \right \}{ s1sn}).

Obtain the ⌈(n+1)(1−α)⌉n\frac{\left \lceil \left ( n+1\right )\left ( 1-\alpha \right )\right \rceil}{n}n(n+1)(1α) quantile of { si}i=1n{\left \{ s_i\right \}}_{i=1}^{n}{ si}i=1n: q^=inf⁡{ q:∣{ i:si≤q}∣n≥⌈(n+1)(1−α)⌉n}=s⌈(n+1)(1−α)⌉\hat{q}=\inf \left \{ q:\frac{\left | \left \{ i:s_i \leq q\right \}\right |}{n} \geq \frac{\left \lceil \left ( n+1\right )\left ( 1-\alpha \right )\right \rceil}{n} \right \} = {s}_{\left \lceil \left ( n+1\right )\left ( 1-\alpha \right )\right \rceil}q^=inf{ q:n{ i:siq}n(n+1)(1α)}=s(n+1)(1α).

Construct the prediction set of (Xtest,Ytest∗)\left ( {X}_{test}, {Y}_{test}^{*}\right )(Xtest,Ytest): C(Xtest)={ y:f^(Xtest)y≥1−q^}={ y:s(Xtest,y)≤q^}\mathcal{C}\left ( {X}_{test}\right )=\left \{ y: {\hat{f}\left ( {X}_{test}\right )}_{y} \geq 1-\hat{q} \right \}=\left \{ y: s\left ( {X}_{test}, y\right )\leq \hat{q}\right \}C(Xtest)={ y:f^(Xtest)y1q^}={ y:s(Xtest,y)q^}.

The event { Ytest∗∈C(Xtest)}\left \{ {Y}_{test}^{*} \in \mathcal{C}\left ( {X}_{test}\right ) \right \}{ YtestC(Xtest)} is equivalent to { s(Xtest,Ytest∗)≤q^}\left \{ s\left ( {X}_{test}, {Y}_{test}^{*}\right )\leq \hat{q}\right \}{ s(Xtest,Ytest)q^}.

By the exchangeability of (X1,Y1),⋯ ,(Xn,Yn),(Xtest,Ytest∗)\left ( X_1, Y_1\right ), \cdots ,\left ( X_n, Y_n\right ), \left ( {X}_{test}, {Y}_{test}^{*}\right )(X1,Y1),,(Xn,Yn),(Xtest

评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值