Skip to content

Changelog

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[0.3.0] - 2026-01-30

Added

  • FAST Tokenizer: Frequency-space action tokenizer using DCT + BPE compression, no training required (~28x compression on invariant sequences)
  • OAT-style Tokenizer: Ordered Action Tokenization with Register Encoder, Finite Scalar Quantization (FSQ), and Masked Nested Dropout for anytime prefix decoding
  • FSQ Module: Finite Scalar Quantization with deterministic rounding -- eliminates VQ codebook collapse by construction
  • Register Encoder: Transformer-based temporal compression using learnable register tokens with causal-last attention
  • Nested Dropout: Masked nested dropout module for inducing coarse-to-fine token ordering (Matryoshka-style)
  • OAT Decoder: Cross-attention Transformer decoder for reconstructing sequences from register token latents
  • Benchmark Suite: Three benchmark scripts comparing all 7 tokenization methods on rate-distortion, SE(3) stability, and prefix decoding
  • Interactive Notebook: Comprehensive tutorial notebook (tutorial_action_tokenization.ipynb) with head-to-head comparisons and visualizations
  • Manuscript Updates: Expanded Section 7 with FAST/OAT descriptions, new Appendix with empirical comparison tables

Fixed

  • SE(3) Invariance: Fixed invariant token stability under SE(3) transforms by using trajectory-relative initial frames (init_pose from trajectory start) instead of fixed global axes. This achieves exact invariance (0% token change, MSE = O(1e-13)) versus the previous approximate invariance
  • BPE Token Comparison: Improved token stability reporting to account for different-length BPE sequences and separate pre-BPE (DCT integer) from post-BPE token differences

Changed

  • einops dependency: Added to [tokenization] optional dependencies for FSQ implementation

[0.2.0] - 2026-02-05

Added

  • VLA Integration: Comprehensive LIBERO and RoboCASA support for Vision-Language-Action benchmarks
  • LIBERO-PRO Support: Full integration with LIBERO-PRO perturbation robustness testing
  • Trajectory Adaptation: Fatrop and CasADi solvers for real-time trajectory retargeting (~7ms)
  • Swap Demo: Compelling demonstration showing DHB-XR vs naive replay under spatial perturbations (6.5cm improvement)
  • Simulation Integration: Real-time LIBERO environment execution with OpenCV visualization and video recording
  • VLA Documentation: Extensive documentation explaining how DHB-XR addresses VLA spatial generalization challenges

Changed

  • Repository URLs: Updated all links to point to robodreamer organization
  • Documentation Updates: Added VLA context throughout README and documentation

Fixed

  • Solver Integration: Proper Fatrop/CasADi solver selection in trajectory adaptation functions

[0.1.2] - 2026-02-04

Added

  • MkDocs Documentation: Migrated from Sphinx to MkDocs with Material theme
  • GitHub Actions CI/CD: Automated documentation builds and deployment
  • Encoding Method Enum: Replaced string literals with type-safe EncodingMethod enum
  • GitHub Pages: Documentation hosting setup

Changed

  • Project Metadata: Updated author email and added scikit-learn dependency
  • Copyright Year: Updated to 2026
  • Pixi Configuration: Renamed project workspace

[0.1.1] - 2026-02-03

Added

  • Trajectory Preprocessing: Robustness modules for handling reversals and zero-motion segments
  • DHB-Token: VQ-VAE tokenization pipeline for discrete action representations
  • Generative Models: Variational Flow Matching (VFM) for multi-modal trajectory generation
  • Motion Database: Similarity search and retrieval with DTW support

[0.1.0] - 2026-01-31

Added

  • Core DHB-XR Implementation: DHB-DR (Double-Reflection) and DHB-QR (Quaternion-Relative) invariant encoding
  • Time-Invariant Reparameterization: DHB-TI for speed-independent trajectory representations
  • GPU Acceleration: PyTorch-based batched operations and CusADi for large-scale optimization
  • Fatrop Integration: Fast structured OCP solver for trajectory optimization
  • CasADi Support: General nonlinear optimization for trajectory adaptation
  • Visualization: Enhanced SE(3) trajectory plotting with matplotlib
  • Jupyter Notebooks: Comprehensive demo notebook with interactive examples

Changed

  • Dependencies: Added CUDA support with PyTorch 2.0+ and optional GPU features
  • Documentation: Enhanced README with performance benchmarks and usage examples

[0.0.1] - 2026-01-30

Added

  • Initial Project Structure: Basic DHB encoding/decoding functionality
  • Project Configuration: Pixi environment, GitHub Actions, and development tooling
  • Basic Testing: Initial test suite setup
  • Documentation: Sphinx-based documentation framework