时序平滑

问题陈述：动作 Chunk 重叠

Action Chunking 策略每次推理输出 n 个动作的序列（通常 n=100）。但推理不是瞬时完成的。模型计算下一个动作 chunk 时，机器人会继续执行上一 chunk 中的动作。这会产生时序重叠问题：

T1: First inference produces [a1, a2, a3, ..., a100]
    Robot begins executing: a1, a2, a3...
    
T2: Inference starts (queue watermark triggered)
    Robot continues: a4, a5, a6...
    
T3: Inference completes after robot executed 30 actions
    New chunk: [b1, b2, b3, ..., b100]
    Remaining old actions: [a31, a32, ..., a100]
    
Problem: How to transition from old plan to new plan?

如果不做平滑，会有两种糟糕选择：

丢弃新 chunk：继续执行过期预测，模型预测会变得无关。
立即替换：从 a30 跳到 b31，造成突兀运动断点。

时序平滑通过混合重叠区间解决此问题。它使用指数加权，兼顾连续性和对新预测的响应能力。

来源： src/action_dispatch/action_dispatch/temporal_smoother.py:44-77, src/action_dispatch/action_dispatch/action_dispatcher_node.py:44-52

算法概览

时序平滑算法分三个阶段运行：

阶段 1：时序对齐图

        graph LR
    subgraph "T1: First_Inference"
        A1["actions1[0:100]<br/>(100 actions)"]
    end
    
    subgraph "T2: Execution_During_Inference"
        E1["executed[0:30]<br/>(30 actions consumed)"]
        R1["remaining[30:100]<br/>(70 actions left)"]
    end
    
    subgraph "T3: Second_Inference"
        A2["actions2[0:100]<br/>(100 new actions)"]
    end
    
    subgraph "T4: Alignment"
        Skip["actions2[0:30]<br/>(skip outdated)"]
        Relevant["actions2[30:100]<br/>(70 relevant new)"]
    end
    
    A1 --> E1
    A1 --> R1
    A2 --> Skip
    A2 --> Relevant
    
    R1 -.->|"overlap with"| Relevant

阶段 2：指数加权混合

        graph TB
    subgraph "Overlap_Region_Processing"
        Old["old_actions[30:100]<br/>(70 old remaining)"]
        New["new_actions[30:100]<br/>(70 new relevant)"]
        
        Old --> Blend["Exponential_Weighted_Blending_Formula"]
        New --> Blend
        
        Blend --> Result["blended[30:100]<br/>(70 smoothed actions)"]
    end
    
    subgraph "Weight_Calculation"
        Count["action_counts[i]<br/>(how many times seen)"]
        Weights["weights = exp(-coeff * k)"]
        Cumsum["cumsum(weights)"]
        
        Count --> Formula["blended[i] = <br/>(old[i] * cumsum[count-1] + <br/>new[i] * weights[count]) <br/>/ cumsum[count]"]
        Weights --> Formula
        Cumsum --> Formula
    end
    
    Formula -.-> Blend

来源： src/action_dispatch/action_dispatch/temporal_smoother.py:145-206, src/action_dispatch/action_dispatch/temporal_smoother.py:208-246

核心组件

时序平滑实现由 src/action_dispatch/action_dispatch/temporal_smoother.py 中的三个主要类组成：

类层级图

        graph TB
    Config["TemporalSmootherConfig<br/>@dataclass"]
    Smoother["TemporalSmoother<br/>Core smoothing logic"]
    Manager["TemporalSmootherManager<br/>Convenience wrapper"]
    
    Config -->|"configures"| Smoother
    Smoother -->|"wrapped by"| Manager
    
    subgraph "TemporalSmootherConfig_Fields"
        F1["enabled: bool = True"]
        F2["chunk_size: int = 100"]
        F3["temporal_ensemble_coeff: float = 0.01"]
        F4["device: Optional[str] = None"]
    end
    
    Config --- F1
    Config --- F2
    Config --- F3
    Config --- F4
    
    subgraph "TemporalSmoother_State"
        S1["_smoothed_actions: torch.Tensor"]
        S2["_action_counts: torch.Tensor"]
        S3["_weights: torch.Tensor"]
        S4["_weights_cumsum: torch.Tensor"]
    end
    
    Smoother --- S1
    Smoother --- S2
    Smoother --- S3
    Smoother --- S4
    
    subgraph "Key_Methods"
        M1["update(new_actions, actions_executed)"]
        M2["get_next_action()"]
        M3["peek_next_action()"]
        M4["reset()"]
    end
    
    Smoother --- M1
    Smoother --- M2
    Smoother --- M3
    Smoother --- M4

来源： src/action_dispatch/action_dispatch/temporal_smoother.py:19-42, src/action_dispatch/action_dispatch/temporal_smoother.py:44-110, src/action_dispatch/action_dispatch/temporal_smoother.py:259-265

TemporalSmootherConfig

定义于 src/action_dispatch/action_dispatch/temporal_smoother.py:19-42 的配置 dataclass。

Field	Type	Default	Description
`enabled`	`bool`	`True`	启用平滑。如果为 `False`，仅执行对齐并透传。
`chunk_size`	`int`	`100`	每个 chunk 的最大动作数。用于权重预计算。
`temporal_ensemble_coeff`	`float`	`0.01`	指数衰减系数。
`device`	`Optional[str]`	`None`	tensor 运算设备（`'cpu'`, `'cuda'`, `'npu:0'`）。为 `None` 时自动检测。

来源： src/action_dispatch/action_dispatch/temporal_smoother.py:19-42

TemporalSmoother

核心平滑实现位于 src/action_dispatch/action_dispatch/temporal_smoother.py:44-257。它维护内部状态：

_smoothed_actions：当前平滑后的动作计划，shape 为 [plan_length, action_dim] src/action_dispatch/action_dispatch/temporal_smoother.py:110。
_action_counts：每个动作被混合的次数，shape 为 [plan_length, 1] src/action_dispatch/action_dispatch/temporal_smoother.py:111。
_weights：预计算指数权重，shape 为 [chunk_size] src/action_dispatch/action_dispatch/temporal_smoother.py:91。
_weights_cumsum：权重累积和，shape 为 [chunk_size] src/action_dispatch/action_dispatch/temporal_smoother.py:92。

关键方法：

Method	Signature	Purpose
`update()`	`update(new_actions, actions_executed_during_inference) -> int`	核心平滑逻辑。对齐并混合新 chunk。 src/action_dispatch/action_dispatch/temporal_smoother.py:145-167
`get_next_action()`	`get_next_action() -> Union[torch.Tensor, np.ndarray]`	从计划中弹出并返回下一个动作。 src/action_dispatch/action_dispatch/temporal_smoother.py:125-143
`peek_next_action()`	`peek_next_action() -> Optional[Union[torch.Tensor, np.ndarray]]`	返回下一个动作但不移除。 src/action_dispatch/action_dispatch/temporal_smoother.py:248-257
`reset()`	`reset()`	清空内部状态，用于新 episode。 src/action_dispatch/action_dispatch/temporal_smoother.py:108-111
`plan_length`	`@property plan_length -> int`	返回当前计划中的动作数。 src/action_dispatch/action_dispatch/temporal_smoother.py:113-118

来源： src/action_dispatch/action_dispatch/temporal_smoother.py:44-257

TemporalSmootherManager

便利封装位于 src/action_dispatch/action_dispatch/temporal_smoother.py:259-322，提供运行时切换和统一接口。所有操作都会委托给内部 TemporalSmoother 实例。

附加方法：

set_enabled(enabled: bool)：运行时开启或关闭平滑 src/action_dispatch/action_dispatch/temporal_smoother.py:318-322。

来源： src/action_dispatch/action_dispatch/temporal_smoother.py:259-322

平滑公式

指数加权混合公式实现于 src/action_dispatch/action_dispatch/temporal_smoother.py:208-246：

# For each action i in the overlap region:
blended[i] = (old[i] * cumsum[count[i]-1] + new[i] * weights[count[i]]) 
             / cumsum[count[i]]

其中：

old[i]：上一平滑计划中的第 i 个动作。
new[i]：新推理结果（对齐后）中的第 i 个动作。
count[i]：动作 i 被混合的次数，从 1 开始。
weights[k] = exp(-temporal_ensemble_coeff * k)。
cumsum[k] 是到索引 k 为止的权重累积和。

权重计算

权重在初始化期间预计算，位置为 src/action_dispatch/action_dispatch/temporal_smoother.py:86-92：

coeff = self.config.temporal_ensemble_coeff
chunk_size = self.config.chunk_size

self._weights = torch.exp(-coeff * torch.arange(chunk_size, dtype=torch.float32))
self._weights_cumsum = torch.cumsum(self._weights, dim=0)

来源： src/action_dispatch/action_dispatch/temporal_smoother.py:86-92, src/action_dispatch/action_dispatch/temporal_smoother.py:208-246

时序对齐过程

update() 方法位于 src/action_dispatch/action_dispatch/temporal_smoother.py:145-206，负责处理时序对齐：

对齐流程图

        flowchart TD
    Start["update(new_actions, actions_executed)"]
    
    ValidateInput["Validate_input_shape<br/>[temporal_smoother.py:168-173]"]
    
    CheckEmpty{"new_actions.shape[0] == 0?"}
    ReturnEarly["Return_current_plan_length"]
    
    ConvertTensor["Convert_to_tensor_on_device<br/>[temporal_smoother.py:175-179]"]
    
    Align["Slice_alignment:<br/>relevant_new = new_actions[actions_executed:]<br/>[temporal_smoother.py:181]"]
    
    CheckState{"Is _smoothed_actions None<br/>or empty?"}
    
    InitializePlan["Initialize_plan:<br/>_smoothed_actions = relevant_new<br/>_action_counts = ones(...)<br/>[temporal_smoother.py:183-189]"]
    
    CheckSmoothing{"config.enabled?"}
    
    ReplaceNoSmooth["Replace_without_smoothing:<br/>_smoothed_actions = relevant_new<br/>[temporal_smoother.py:190-196]"]
    
    ApplySmoothing["_apply_smoothing()<br/>[temporal_smoother.py:198-204]"]
    
    ReturnLength["Return_plan_length"]
    
    Start --> ValidateInput
    ValidateInput --> CheckEmpty
    CheckEmpty -->|"Yes"| ReturnEarly
    CheckEmpty -->|"No"| ConvertTensor
    ConvertTensor --> Align
    Align --> CheckState
    CheckState -->|"Yes"| InitializePlan
    CheckState -->|"No"| CheckSmoothing
    CheckSmoothing -->|"No"| ReplaceNoSmooth
    CheckSmoothing -->|"Yes"| ApplySmoothing
    InitializePlan --> ReturnLength
    ReplaceNoSmooth --> ReturnLength
    ApplySmoothing --> ReturnLength

来源： src/action_dispatch/action_dispatch/temporal_smoother.py:145-206

与 Action Dispatcher 集成

TemporalSmoother 按以下方式集成到 ActionDispatcherNode：

        graph TB
    subgraph "ActionDispatcherNode"
        Client["ActionClient<br/>DispatchInfer"]
        Queue["_queue<br/>collections.deque"]
        Smoother["_smoother<br/>TemporalSmootherManager"]
        Executor["TopicExecutor<br/>Action execution"]
    end
    
    subgraph "Inference_Service"
        InfServer["LeRobotPolicyNode<br/>Action Server"]
    end
    
    subgraph "Control_Loop_State"
        PlanLenStart["_plan_length_at_inference_start"]
        CurrentLen["_smoother.plan_length"]
        Executed["actions_executed =<br/>PlanLenStart - CurrentLen"]
    end
    
    Client -->|"Send goal when<br/>queue < watermark"| InfServer
    InfServer -->|"Return action chunk"| Client
    
    Client -->|"Decode result"| Smoother
    PlanLenStart --> Executed
    Executed -->|"actions_executed"| Smoother
    
    Smoother -->|"Update smoothed plan"| Queue
    Queue -->|"Pop at control_frequency"| Executor
    
    Executor -->|"/joint_commands"| Hardware["ros2_control"]

动作分发器会：

在 _plan_length_at_inference_start 中记录推理开始时的队列长度 src/action_dispatch/action_dispatch/action_dispatcher_node.py:108。
结果到达时，将开始时计划长度和当前长度之差计算为 actions_executed src/action_dispatch/action_dispatch/action_dispatcher_node.py:255-257。
将 (new_chunk, actions_executed) 传给 _smoother.update() src/action_dispatch/action_dispatch/action_dispatcher_node.py:260。
在 _control_loop 中弹出平滑后的动作进行执行 src/action_dispatch/action_dispatch/action_dispatcher_node.py:291。

来源： src/action_dispatch/action_dispatch/action_dispatcher_node.py:43-300, src/action_dispatch/action_dispatch/temporal_smoother.py:145-167

Launch Files 中的配置参数

时序平滑参数可通过 ActionDispatcherNode 中的 ROS 2 parameters 配置：

Parameter	Type	Default	Description
`temporal_smoothing_enabled`	`bool`	`False`	启用或禁用平滑 src/action_dispatch/action_dispatch/action_dispatcher_node.py:73。
`temporal_ensemble_coeff`	`double`	`0.01`	指数衰减系数 src/action_dispatch/action_dispatch/action_dispatcher_node.py:74。
`chunk_size`	`int`	`100`	每个 chunk 的最大动作数 src/action_dispatch/action_dispatch/action_dispatcher_node.py:75。
`smoothing_device`	`string`	`''`	计算设备，空值表示自动检测 src/action_dispatch/action_dispatch/action_dispatcher_node.py:76。

来源： src/action_dispatch/action_dispatch/action_dispatcher_node.py:58-84

测试

完整测试位于 src/action_dispatch/test/test_temporal_smoother.py:1-247。关键测试场景包括：

Test Class	Coverage
`TestTemporalSmootherConfig`	配置验证、参数默认值 src/action_dispatch/test/test_temporal_smoother.py:23-48。
`TestTemporalSmoother`	基本 update/get、禁用平滑、跨帧平滑、tensor 输入、reset src/action_dispatch/test/test_temporal_smoother.py:50-174。
`TestTemporalSmootherManager`	manager 接口、运行时切换、peek 操作 src/action_dispatch/test/test_temporal_smoother.py:176-216。
`TestSmoothingFormula`	权重计算、系数影响、累积和 src/action_dispatch/test/test_temporal_smoother.py:218-247。

来源： src/action_dispatch/test/test_temporal_smoother.py:1-247