Intelligent Machine Learning (IML) targets to setup a full-stack, high-performant and intelligent infrastructure of deep learning for both offline and online, including data processing, model training, model evaluation, and model inferencing, and makes DL real engineering-free and democratic for AI-driven biz.
IML
Pinned
Repositories
Showing 7 of 7 repositories
- flash-attention Public Forked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention