TokenButler -- Predict token importance for all heads across the transformer in the first layer itself. Enable fine-grained token sparsity!
YASH AKHAURI
akhauriyash
AI & ML interests
None yet
Recent Activity
new activity
about 1 month ago
akhauriyash/Code-Regression:Improve dataset card: Add paper, code, project links, task category, and sample usage
updated
a dataset
about 1 month ago
akhauriyash/Code-Regression
updated
a dataset
about 1 month ago
akhauriyash/GraphArch-Regression
Organizations
None yet