17 Mar Google researchers introduce Multimodal Bottleneck Transformer for audiovisual fusion Kartik Wali attention bottlenecks for multimodal fusion Machine perception models are usually modality-specific and optimised for unimodal benchmarks.