Scalable Attention Mechanisms for Real-Time Sensor Fusion in Autonomous Industrial Robots
We present SARTI, a transformer-based architecture fusing heterogeneous sensor streams — LiDAR, tactile, and infrared — at sub-10ms latency. Validated across seven industrial environments, achieving 97.3% object classification accuracy under heavy occlusion — a 14-point improvement over all existing baselines. Cross-modal positional encoding is the dominant contributor to performance gains.
Show more