Nemo readme revisions #9129

jgerh · 2024-05-07T18:42:14Z

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Collection: [Note which collection this PR will affect]

Changelog

Add specific line by line info of high level changes in this PR.

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

Related to # (issue)

README.rst

Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: jgerh <163925524+jgerh@users.noreply.github.com>

titu1994

Overall text changes are nice, but it leaves many ambiguities, with respect to what features are available for each domain, so please correct those.

Separately, during a recent conference, I have had comments from researchers saying they could not find the ASR models and features supported in NeMo after 1.23 - when the previous refactor pushed all the domain docs up inside of the nemo repo - and left them completely invisible to the world.

Most people will not click links nested in a wall of text to hunt down domain features and docs. So i request that the domain docs be added to the end of the NeMo main readme

titu1994 · 2024-05-15T17:26:55Z

README.rst

-and text-to-speech synthesis (TTS).
-The primary objective of NeMo is to provide a scalable framework for researchers and developers from industry and academia
-to more easily implement and design new generative AI models by being able to leverage existing code and pretrained models.
+NVIDIA NeMo Framework is a scalable and cloud-native generative AI framework built for researchers and PyTorch developers working on `Large Language Models <nemo/collections/nlp/README.md>`_ (LLMs), `Multimodal Models <nemo/collections/multimodal/README.md>`_ (MMs), `Automatic Speech Recognition <nemo/collections/asr/README.md>`_ (ASR), `Text to Speech <nemo/collections/tts/README.md>`_ (TTS), and `Computer Vision <nemo/collections/vision/README.md>`_ (CV). It is designed to help you efficiently create, customize, and deploy new generative AI models by leveraging existing code and pre-trained model checkpoints.


Please pull out the domain specific readmes at the end of the main readme. We have had questions about ASR features and models we support at ICASSP this year already due to the hiding of the ASR domain features inside if nemo/collections/asr/README.md.

Revert back the Key Features link. In addition, this issue will be addressed on a separate PR.

titu1994 · 2024-05-15T17:27:32Z

README.rst


-For technical documentation, please see the `NeMo Framework User Guide <https://docs.nvidia.com/nemo-framework/user-guide/latest/playbooks/index.html>`_.


Please add back the link to the documentation at the very top - it is impossible to find documentation in a wall of text somewhere in the middle.

Create a documentation heading and add user guide link here.

titu1994 · 2024-05-15T17:29:12Z

README.rst


-When applicable, NeMo models take advantage of the latest possible distributed training techniques,
-including parallelism strategies such as
+When applicable, NeMo models leverage cutting-edge distributed training techniques, incorporating `parallelism strategies <https://docs.nvidia.com/nemo-framework/user-guide/latest/modeloverview.html>`_ to enable efficient training of very large models. These techniques include Tensor Parallelism (TP), Pipeline Parallelism (PP), Fully Sharded Data Parallelism (FSDP), Mixture-of-Experts (MoE), and Mixed Precision Training with BFloat16 and FP8, as well as others.


Be explicit - only NeMo LLM and Multimodal Models can leverage parallel strategies like above

titu1994 · 2024-05-15T17:29:49Z

README.rst


-For technical documentation, please see the `NeMo Framework User Guide <https://docs.nvidia.com/nemo-framework/user-guide/latest/playbooks/index.html>`_.
+Model Training, Alignment, and Customization


Be explicit - LLM Model Training, ALignment and Customization

titu1994 · 2024-05-15T17:31:07Z

README.rst


-NeMo LLMs can be aligned with state of the art methods such as SteerLM, DPO and Reinforcement Learning from Human Feedback (RLHF),
-see `NVIDIA NeMo Aligner <https://github.com/NVIDIA/NeMo-Aligner>`_ for more details.
+Model Deployment and Optimization


LLM Deployment and Optimization

Change to LLM and MM Model . . .

titu1994 · 2024-05-15T17:44:02Z

README.rst

@@ -408,35 +363,32 @@ To install Apex, run
    git checkout $apex_commit
    pip install . -v --no-build-isolation --disable-pip-version-check --no-cache-dir --config-settings "--build-option=--cpp_ext --cuda_ext --fast_layer_norm --distributed_adam --deprecated_fused_adam --group_norm"

+When attempting to install Apex separately from the NVIDIA PyTorch container, you might encounter an error if the CUDA version on your system is different from the one used to compile PyTorch. To bypass this error, you can comment out the relevant line in the setup file located in the Apex repository on GitHub here: https://github.com/NVIDIA/apex/blob/master/setup.py#L32.


Sidenote @ericharper can we request apex folks to remove this hardcoded check ? We almost always have to uncomment it anyway, its a pain to have to clone the repo and manually edit files to get something to work

titu1994 · 2024-05-15T17:46:31Z

README.rst

-To use a pre-built container, please run
+NeMo containers are launched concurrently with NeMo version updates. For example, the release of NeMo ``r1.23.0`` comes with the container ``nemo:24.01.speech``. The latest containers are:
+
+* NeMo LLM and MM container - `nvcr.io/nvidia/nemo:24.03.framework`


This should be updated to the unified container if its out already

Submit a PR to update the container version. Stet as is.

titu1994 · 2024-05-15T17:47:24Z

README.rst


+Get Help


Revert - it sounds very wrong to say "Get Help" - just keep it as Contributing & Discussion

titu1994 · 2024-05-15T17:48:05Z

README.rst


-We welcome community contributions! Please refer to `CONTRIBUTING.md <https://github.com/NVIDIA/NeMo/blob/stable/CONTRIBUTING.md>`_ for the process.
-
-Publications


Add back publications, we have a new page for research publications

titu1994 · 2024-05-15T17:48:33Z

README.rst


-If you would like to add your own article to the list, you are welcome to do so via a pull request to this repository's ``gh-pages-src`` branch.
-Please refer to the instructions in the `README of that branch <https://github.com/NVIDIA/NeMo/tree/gh-pages-src#readme>`_.
+To contribute an article to the collection, please submit a pull request to the ``gh-pages-src`` branch of this repository. For detailed information, please consult the README located at the `gh-pages-src branch <https://github.com/NVIDIA/NeMo/tree/gh-pages-src#readme>`_.


TODO: @erastorgueva-nv update this part after the PR is refactored and merged

…into nemo-readme-revisions

jgerh added 2 commits May 6, 2024 14:39

REvisions to NeMo ReadMe

1e412ae

NeMo Readme.rst revisions

4d5a284

jgerh self-assigned this May 7, 2024

jgerh requested a review from ericharper May 7, 2024 19:07

jgerh assigned jgerh and unassigned jgerh May 7, 2024

ericharper reviewed May 15, 2024

View reviewed changes

README.rst Outdated Show resolved Hide resolved

Update README.rst

290ae64

Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: jgerh <163925524+jgerh@users.noreply.github.com>

titu1994 requested changes May 15, 2024

View reviewed changes

jgerh and others added 5 commits May 17, 2024 17:55

ReadMe updates

900b9eb

ReadMe Updates

f4cc07a

Merge branch 'NVIDIA:main' into nemo-readme-revisions

9ad284d

Updates to NeMo Readme with new license information

5ead9cb

Merge branch 'nemo-readme-revisions' of https://github.com/jgerh/NeMo …

ec377cb

…into nemo-readme-revisions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nemo readme revisions #9129

Nemo readme revisions #9129

jgerh commented May 7, 2024

titu1994 left a comment •

edited

titu1994 May 15, 2024

jgerh May 17, 2024

titu1994 May 15, 2024

jgerh May 17, 2024

titu1994 May 15, 2024

titu1994 May 15, 2024

titu1994 May 15, 2024

jgerh May 17, 2024

titu1994 May 15, 2024

titu1994 May 15, 2024

jgerh May 17, 2024

titu1994 May 15, 2024

jgerh May 17, 2024

titu1994 May 15, 2024

jgerh May 17, 2024

titu1994 May 15, 2024


		For technical documentation, please see the `NeMo Framework User Guide <https://docs.nvidia.com/nemo-framework/user-guide/latest/playbooks/index.html>`_.


		For technical documentation, please see the `NeMo Framework User Guide <https://docs.nvidia.com/nemo-framework/user-guide/latest/playbooks/index.html>`_.
		Model Training, Alignment, and Customization


		We welcome community contributions! Please refer to `CONTRIBUTING.md <https://github.com/NVIDIA/NeMo/blob/stable/CONTRIBUTING.md>`_ for the process.

		Publications


		Get Help

Nemo readme revisions #9129

Are you sure you want to change the base?

Nemo readme revisions #9129

Conversation

jgerh commented May 7, 2024

What does this PR do ?

Changelog

Usage

GitHub Actions CI

Before your PR is "Ready for review"

Who can review?

Additional Information

titu1994 left a comment • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

titu1994 left a comment •

edited