Attention is all you need CLIP Decision Transformer Multimodal Learning Gato Decision Transformer CICERO GPT4 LLMs SAM (Segment Anything)