aria-amt

Efficient and robust implementation of seq-to-seq automatic piano transcription.

Install

Requires Python 3.11

git clone https://github.com/EleutherAI/aria-amt.git
cd aria-amt
pip install -e .

Download the preliminary model weights:

Piano (v1)

wget https://storage.googleapis.com/aria-checkpoints/amt/piano-medium-stacked-1.0.safetensors

Usage

You can download mp3s from youtube using yt-dlp:

yt-dlp --audio-format mp3 --extract-audio --no-playlist --audio-quality 0 <youtube-link> -o <save-path>

You can then transcribe using the cli:

aria-amt transcribe \
    medium-stacked \
    <path-to-checkpoint> \
    -load_path <path-to-audio> \
    -save_dir <path-to-save-dir> \
    -bs 1 \
    -compile \
    -q8

If you want to do batch transcription, use the -load_dir flag and adjust -bs accordingly. Compiling and may take some time, but provides a significant speedup. Quantizing (-q8 flag) further speeds up inference when the -compile flag is also used.

NOTE: Int8 quantization is only supported on GPUs that support BF16.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github/workflows		.github/workflows
amt		amt
config		config
scripts		scripts
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

amt

amt

config

config

scripts

scripts

tests

tests

.gitignore

.gitignore

LICENSE

LICENSE

MANIFEST.in

MANIFEST.in

Makefile

Makefile

README.md

README.md

requirements-dev.txt

requirements-dev.txt

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

aria-amt

Install

Usage

About

Releases

Packages

Contributors 2

Languages

License

EleutherAI/aria-amt

Folders and files

Latest commit

History

Repository files navigation

aria-amt

Install

Usage

About

Resources

License

Stars

Watchers

Forks

Languages