Feature 投毒攻擊s

Advanced11 min readUpdated 2026-03-15

Techniques for poisoning feature store data to manipulate model behavior: direct feature value manipulation, time-travel attacks, online/offline store consistency exploitation, and targeted entity-level feature poisoning.

feature-poisoning data-poisoning feature-store time-travel online-offline-skew targeted-attacks

Feature Poisoning 攻擊

Feature 投毒 manipulates the data that models consume at 推論 time or during 訓練, without modifying 模型 itself. 這是 a powerful attack vector 因為 it bypasses all model-level 安全 controls -- 模型 is genuine and unmodified, but it receives corrupted inputs. Feature stores amplify this risk by centralizing feature data: a single 投毒 point affects every model that consumes the compromised feature.

Direct Feature Value Manipulation

Online Store Poisoning

The online store serves features for real-time 推論. Modifying values in the online store immediately affects predictions.

攻擊 requirements:

Write access to the online store (Redis, DynamoDB, Bigtable, etc.)
Knowledge of the feature schema (entity keys, feature names, data types)
理解 of how target models use the features

Targeted Entity Poisoning

Rather than 投毒 features broadly, 攻擊者 can target specific entities (users, items, transactions) for precise manipulation:

識別 the target entity
Determine which entity's predictions you want to manipulate. In a credit risk model, this might be a specific applicant. In a recommendation system, a specific user.
Map features to model behavior
理解 which features have the most influence on 模型's predictions. Feature importance can be estimated through model documentation, API probing, or public research on similar models.

Craft poisoned feature values

Modify the target entity's features in the online store to steer 模型's prediction. For a credit risk model, increase income features and decrease debt features to change a denial to an approval.

import redis
import struct
 
# Connect to the feature store's online store (Redis)
r = redis.Redis(host="feature-store-redis", port=6379)
 
# Target entity: user_id = "target_user_123"
# Feature: credit_score (stored as float)
entity_key = "user_features:target_user_123"
 
# Read current feature value
current_score = struct.unpack('f', r.hget(entity_key, "credit_score"))[0]
print(f"Current credit score: {current_score}")
 
# Poison: set credit score to a high value
poisoned_score = struct.pack('f', 850.0)
r.hset(entity_key, "credit_score", poisoned_score)
 
# The next 推論 request for this user will use
# the manipulated credit score

Verify the manipulation
Send an 推論 request for the target entity and confirm the prediction changed as expected. 模型's response reflects the poisoned features.

Broad Feature Poisoning

For maximum disruption, poison features broadly to degrade overall model quality:

Strategy	Technique	Impact
Value zeroing	Set all instances of a feature to zero	Model loses information from that feature
Value randomization	Replace feature values with random values	Model receives noise instead of signal
Distribution shift	Shift all values by a constant offset	Model operates outside trained distribution
Type coercion	Store string values where numbers are expected	Feature parsing errors or silent corruption
Null injection	Delete feature entries	Model falls back to defaults or errors

Time-Travel 攻擊

Point-in-Time Retrieval

Feature stores support point-in-time retrieval to prevent data leakage during 訓練. Training pipelines request features as they existed at the time of each 訓練 example. This mechanism, designed for correctness, creates an 攻擊面.

Exploiting Time-Travel

識別 the 訓練 window
Determine when the target model's 訓練資料 was sampled. Training pipelines typically use a specific date range for feature retrieval.
Backdate poisoned features
Insert or modify feature values with timestamps that fall within the 訓練 window. The feature store's point-in-time retrieval returns the poisoned values when 訓練 requests features from that time period.
Trigger retraining
Wait for (or trigger) model retraining. The new 訓練 run retrieves features from the poisoned time window and trains on corrupted data.
Maintain clean current values
Keep current feature values clean so that manual inspection of the online store shows correct data. The poison exists only in the historical record.

Time-Travel 攻擊 Subtlety

Time-travel attacks are particularly difficult to detect 因為:

Current feature values are correct; only historical values are poisoned
Point-in-time queries return different results depending on the requested timestamp
Data validation on current data does not inspect historical records
The poison activates only when a 訓練 pipeline queries the poisoned time window

Online/Offline Consistency 利用

Intentional Skew Creation

Feature stores maintain online (serving) and offline (訓練) copies of features. The materialization process keeps them synchronized. Disrupting materialization or manipulating one store independently creates 訓練-serving skew that attackers can 利用.

Skew 攻擊 Patterns

Pattern	Mechanism	Effect on Model
Materialization delay	Block or slow the materialization pipeline	Online store serves stale features; model predictions based on outdated data
Selective materialization failure	Cause materialization to fail for specific features	Some features stale, others current; model receives inconsistent feature snapshots
Offline-only modification	Modify offline store, let materialization propagate	Poison flows from offline to online via normal sync; appears as legitimate data update
Online-only modification	Modify online store after materialization	Poison affects serving until next materialization overwrites it; creates a time window for targeted attacks
Dual 投毒	Modify both stores consistently	Most complete but requires access to both stores; evades consistency 監控

Exploiting Consistency Checks

Some feature stores monitor consistency between online and offline stores. 攻擊者 can 利用 these checks:

False positive flooding. Introduce many small, harmless inconsistencies to trigger consistency alerts. When the team investigates and finds false positives, they may relax 監控 thresholds or ignore alerts, creating cover for the real 投毒.

監控 blind spots. 識別 which features are monitored for consistency and target unmonitored features. Feature stores with thousands of features rarely monitor every one.

Evasion Techniques

Distribution-Preserving Poisoning

Naive feature 投毒 shifts the statistical distribution of feature values, making it detectable by distribution 監控. Sophisticated 投毒 preserves aggregate statistics while targeting specific entities.

Mean-preserving 投毒. Increase the feature value for the target entity and decrease it for a non-critical entity by the same amount. The mean remains unchanged.

Variance-preserving 投毒. Modify values within the existing distribution's range so that statistical tests (KS 測試, chi-square) do not detect a shift.

Gradual 投毒. Modify feature values incrementally over time, staying within natural drift expectations. Each modification is too small to trigger alerts, but the cumulative effect is significant.

import numpy as np
 
def preserve_distribution_poison(
    feature_values: dict,
    target_entity: str,
    target_value: float,
    non_critical_entities: list
):
    """
    Modify target entity's feature while preserving
    overall distribution statistics.
    """
    original_value = feature_values[target_entity]
    delta = target_value - original_value
 
    # Distribute the opposite delta across non-critical entities
    # to preserve the mean
    compensation_per_entity = -delta / len(non_critical_entities)
 
    poisoned = feature_values.copy()
    poisoned[target_entity] = target_value
 
    for entity in non_critical_entities:
        poisoned[entity] += compensation_per_entity
 
    # Verify statistics preserved
    original_mean = np.mean(list(feature_values.values()))
    poisoned_mean = np.mean(list(poisoned.values()))
    assert abs(original_mean - poisoned_mean) < 1e-6
 
    return poisoned

Timing-Based Evasion

Pre-監控 windows. Execute 投毒 before 監控 starts (e.g., before business hours when alerts are reviewed).

Post-materialization windows. Poison online store immediately after materialization. The next materialization will overwrite the poison, so the window is limited but may be sufficient for targeted 利用.

Maintenance windows. 利用 scheduled maintenance periods when 監控 may be reduced.

Impact 評估

Severity by Domain

Domain	Feature Poisoning Impact
Financial services	Fraudulent transaction approval, credit score manipulation
Healthcare	Incorrect risk assessments, treatment recommendations
Advertising	Budget misallocation, competitor ad suppression
Content recommendation	Amplification of specific content, suppression of others
Autonomous systems	Sensor data manipulation, incorrect environmental 評估

Blast Radius

The blast radius of feature 投毒 depends on how many models consume the poisoned feature:

A feature used by one model affects one prediction surface
A shared feature used by ten models affects all ten
Feature stores are designed for sharing, so shared features are the norm

偵測 and Prevention

偵測 Signals

Signal	What It Detects	Limitation
Distribution 監控	Broad distribution shifts	Misses targeted entity-level 投毒
Entity-level anomaly 偵測	Individual entity feature changes	High false positive rate for volatile features
Consistency 監控	Online/offline divergence	Misses dual-store 投毒
Access logging	Unauthorized write access	Does not detect authorized-but-malicious access
Value range checks	Out-of-bounds values	Misses in-range but incorrect values

Prevention Controls

Immutable offline store. Use append-only storage for historical features; prevent modification of existing records.
Write access restriction. Minimize the number of service accounts and users with write access to online and offline stores.
Feature value signing. Sign feature values at computation time and verify at serving time.
Real-time anomaly 偵測. Monitor feature values at serving time for entity-level anomalies.
Materialization integrity. Verify that materialized values match offline store values.

參考文獻

Goldblum et al., "Dataset 安全 for Machine Learning" -- 資料投毒 survey
Feast Documentation -- Open-source feature store
Tecton 安全 -- Enterprise feature store 安全

Knowledge Check

攻擊者 poisons feature values in the offline store with backdated timestamps that fall within the 訓練 window. Current feature values remain clean. A daily distribution 監控 job checks online store features. Will this attack be detected?

Feature 投毒攻擊s

Advanced11 min readUpdated 2026-03-15

feature-poisoning data-poisoning feature-store time-travel online-offline-skew targeted-attacks

Feature Poisoning 攻擊

Direct Feature Value Manipulation

Online Store Poisoning

The online store serves features for real-time 推論. Modifying values in the online store immediately affects predictions.

攻擊 requirements:

Write access to the online store (Redis, DynamoDB, Bigtable, etc.)
Knowledge of the feature schema (entity keys, feature names, data types)
理解 of how target models use the features

Targeted Entity Poisoning

Rather than 投毒 features broadly, 攻擊者 can target specific entities (users, items, transactions) for precise manipulation:

識別 the target entity
Determine which entity's predictions you want to manipulate. In a credit risk model, this might be a specific applicant. In a recommendation system, a specific user.
Map features to model behavior
理解 which features have the most influence on 模型's predictions. Feature importance can be estimated through model documentation, API probing, or public research on similar models.

Craft poisoned feature values

Modify the target entity's features in the online store to steer 模型's prediction. For a credit risk model, increase income features and decrease debt features to change a denial to an approval.

import redis
import struct
 
# Connect to the feature store's online store (Redis)
r = redis.Redis(host="feature-store-redis", port=6379)
 
# Target entity: user_id = "target_user_123"
# Feature: credit_score (stored as float)
entity_key = "user_features:target_user_123"
 
# Read current feature value
current_score = struct.unpack('f', r.hget(entity_key, "credit_score"))[0]
print(f"Current credit score: {current_score}")
 
# Poison: set credit score to a high value
poisoned_score = struct.pack('f', 850.0)
r.hset(entity_key, "credit_score", poisoned_score)
 
# The next 推論 request for this user will use
# the manipulated credit score

Verify the manipulation
Send an 推論 request for the target entity and confirm the prediction changed as expected. 模型's response reflects the poisoned features.

Broad Feature Poisoning

For maximum disruption, poison features broadly to degrade overall model quality:

Strategy	Technique	Impact
Value zeroing	Set all instances of a feature to zero	Model loses information from that feature
Value randomization	Replace feature values with random values	Model receives noise instead of signal
Distribution shift	Shift all values by a constant offset	Model operates outside trained distribution
Type coercion	Store string values where numbers are expected	Feature parsing errors or silent corruption
Null injection	Delete feature entries	Model falls back to defaults or errors

Time-Travel 攻擊

Point-in-Time Retrieval

Exploiting Time-Travel

識別 the 訓練 window
Determine when the target model's 訓練資料 was sampled. Training pipelines typically use a specific date range for feature retrieval.
Backdate poisoned features
Insert or modify feature values with timestamps that fall within the 訓練 window. The feature store's point-in-time retrieval returns the poisoned values when 訓練 requests features from that time period.
Trigger retraining
Wait for (or trigger) model retraining. The new 訓練 run retrieves features from the poisoned time window and trains on corrupted data.
Maintain clean current values
Keep current feature values clean so that manual inspection of the online store shows correct data. The poison exists only in the historical record.

Time-Travel 攻擊 Subtlety

Time-travel attacks are particularly difficult to detect 因為:

Current feature values are correct; only historical values are poisoned
Point-in-time queries return different results depending on the requested timestamp
Data validation on current data does not inspect historical records
The poison activates only when a 訓練 pipeline queries the poisoned time window

Online/Offline Consistency 利用

Intentional Skew Creation

Skew 攻擊 Patterns

Pattern	Mechanism	Effect on Model
Materialization delay	Block or slow the materialization pipeline	Online store serves stale features; model predictions based on outdated data
Selective materialization failure	Cause materialization to fail for specific features	Some features stale, others current; model receives inconsistent feature snapshots
Offline-only modification	Modify offline store, let materialization propagate	Poison flows from offline to online via normal sync; appears as legitimate data update
Online-only modification	Modify online store after materialization	Poison affects serving until next materialization overwrites it; creates a time window for targeted attacks
Dual 投毒	Modify both stores consistently	Most complete but requires access to both stores; evades consistency 監控

import numpy as np
 
def preserve_distribution_poison(
    feature_values: dict,
    target_entity: str,
    target_value: float,
    non_critical_entities: list
):
    """
    Modify target entity's feature while preserving
    overall distribution statistics.
    """
    original_value = feature_values[target_entity]
    delta = target_value - original_value
 
    # Distribute the opposite delta across non-critical entities
    # to preserve the mean
    compensation_per_entity = -delta / len(non_critical_entities)
 
    poisoned = feature_values.copy()
    poisoned[target_entity] = target_value
 
    for entity in non_critical_entities:
        poisoned[entity] += compensation_per_entity
 
    # Verify statistics preserved
    original_mean = np.mean(list(feature_values.values()))
    poisoned_mean = np.mean(list(poisoned.values()))
    assert abs(original_mean - poisoned_mean) < 1e-6
 
    return poisoned

Domain	Feature Poisoning Impact
Financial services	Fraudulent transaction approval, credit score manipulation
Healthcare	Incorrect risk assessments, treatment recommendations
Advertising	Budget misallocation, competitor ad suppression
Content recommendation	Amplification of specific content, suppression of others
Autonomous systems	Sensor data manipulation, incorrect environmental 評估

Blast Radius

The blast radius of feature 投毒 depends on how many models consume the poisoned feature:

A feature used by one model affects one prediction surface
A shared feature used by ten models affects all ten
Feature stores are designed for sharing, so shared features are the norm

偵測 and Prevention

偵測 Signals

Signal	What It Detects	Limitation
Distribution 監控	Broad distribution shifts	Misses targeted entity-level 投毒
Entity-level anomaly 偵測	Individual entity feature changes	High false positive rate for volatile features
Consistency 監控	Online/offline divergence	Misses dual-store 投毒
Access logging	Unauthorized write access	Does not detect authorized-but-malicious access
Value range checks	Out-of-bounds values	Misses in-range but incorrect values

Prevention Controls

Immutable offline store. Use append-only storage for historical features; prevent modification of existing records.
Write access restriction. Minimize the number of service accounts and users with write access to online and offline stores.
Feature value signing. Sign feature values at computation time and verify at serving time.
Real-time anomaly 偵測. Monitor feature values at serving time for entity-level anomalies.
Materialization integrity. Verify that materialized values match offline store values.

參考文獻

Goldblum et al., "Dataset 安全 for Machine Learning" -- 資料投毒 survey
Feast Documentation -- Open-source feature store
Tecton 安全 -- Enterprise feature store 安全

Knowledge Check

Feature 投毒 攻擊s

識別 the target entity

Map features to model behavior

Craft poisoned feature values

Verify the manipulation

識別 the 訓練 window

Backdate poisoned features

Trigger retraining

Maintain clean current values

Related articles

Feature 投毒 攻擊s

識別 the target entity

Map features to model behavior

Craft poisoned feature values

Verify the manipulation

識別 the 訓練 window

Backdate poisoned features

Trigger retraining

Maintain clean current values

Related articles

Feature 投毒攻擊s

Feature 投毒攻擊s