Anthonny Olime
Aviv-anthonnyolime
AI & ML interests
None yet
Recent Activity
liked
a model
8 days ago
BAAI/Emu3.5
liked
a model
11 days ago
meituan-longcat/LongCat-Video
liked
a model
11 days ago
MiniMaxAI/MiniMax-M2
Organizations
Dataset
Paper - Multimodal
Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding
-
Flowing from Words to Pixels: A Framework for Cross-Modality Evolution
Paper • 2412.15213 • Published • 28 -
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper • 2412.11768 • Published • 43 -
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 157 -
Autoregressive Video Generation without Vector Quantization
Paper • 2412.14169 • Published • 14
Text-to-image
Audio model
Papers
-
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance
Paper • 2502.08127 • Published • 58 -
Vision Transformers Need Registers
Paper • 2309.16588 • Published • 83 -
HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation
Paper • 2504.13072 • Published • 13 -
What are you sinking? A geometric approach on attention sink
Paper • 2508.02546 • Published • 1
Model - Misc
Audio Dataset
Omni-model
3D - House aminites model
Papers
-
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance
Paper • 2502.08127 • Published • 58 -
Vision Transformers Need Registers
Paper • 2309.16588 • Published • 83 -
HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation
Paper • 2504.13072 • Published • 13 -
What are you sinking? A geometric approach on attention sink
Paper • 2508.02546 • Published • 1
Dataset
Model - Misc
Paper - Multimodal
Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding
-
Flowing from Words to Pixels: A Framework for Cross-Modality Evolution
Paper • 2412.15213 • Published • 28 -
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper • 2412.11768 • Published • 43 -
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 157 -
Autoregressive Video Generation without Vector Quantization
Paper • 2412.14169 • Published • 14
Audio Dataset
Text-to-image
Omni-model
Audio model