Downloading the model

Access HuggingFace Hub

Visit the HuggingFace hub.
Use the search bar to look for "Llama v3" models.

Identify the Correct Models

Click on the Meta Llama3 collection

We are downloading Meta-Llama-3-8B.

If you have not done already, you will have to get permission to download the Llama3 models. It is a simple process.

We note that this model has not yet been converted to Huggingface weights.

Models with 'hf' in their name are already converted to HuggingFace checkpoints. This means they are ready to use and require no further conversion.

Download the model into your model directory

First, we must connect to Huggingface.

If you have not done already, create a repository to store a Huggingface token

git config --global credential.helper store

Then connect to Huggingface

huggingface-cli login

When asked to enter the authentication code, use:

Your Huggingface token....

What is a Huggingface User Token

User Access Tokens on the Hugging Face platform serve as a secure method for authenticating applications and notebooks to access Hugging Face services.

Purpose of User Access Tokens

Preferred Authentication Method: They are the recommended way to authenticate an application or notebook to Hugging Face services.
Management: Managed through the user's settings on the Hugging Face platform.

Scope and Roles

Read Role: Tokens with this role provide read access to both public and private repositories that the user or their organization owns. They are suitable for tasks like downloading private models or performing inference.
Write Role: In addition to read privileges, tokens with the write role can modify content in repositories where the user has write access. This includes creating or updating repository content, such as training or modifying a model card.

Managing User Access Tokens

Creation: Access tokens are created in the user's settings under the Access Tokens tab.
Customization: Users can select a role and name for each token.
Control: Tokens can be deleted or refreshed for security purposes.

Usage of User Access Tokens

Versatility: Can be used in various ways, including:
- As a replacement for a password for Git operations or basic authentication.
- As a bearer token for calling the Inference API.
- Within Hugging Face Python libraries, like transformers or datasets, by passing the token for accessing private models or datasets.
Security Warning: Users are cautioned to safeguard their tokens to prevent unauthorized access to their private repositories.

Best Practices

Separate Tokens for Different Uses: Create distinct tokens for different applications or contexts (e.g., local machine, Colab notebook, custom servers).
Role Appropriateness: Assign only the necessary role to each token. If only read access is needed, limit the token to the read role.
Token Management: Regularly rotate and manage tokens to ensure security, especially if a token is suspected to be compromised.

When asked whether you want to add your Huggingface token as a git token credential, say yes. This should be the output:

Token is valid (permission: read).
Your token has been saved in your configured git credential helpers (store).
Your token has been saved to /home/paperspace/.cache/huggingface/token
Login successful

The install git large file storage so we can download the model (which is large)

git lfs install

If successful, the following output will be displayed in your terminal

Updated git hooks.
Git LFS initialized.

What is Git large file storage

Git Large File Storage (LFS) is an extension for Git that addresses issues related to handling large files and binary data in Git repositories.

Why Git LFS is Necessary

Limitations with Large Files in Git:
- Standard Git is excellent for handling text files (like code), which are typically small and benefit from Git's delta compression (storing only changes between versions). However, Git isn't optimized for large binary files (like images, videos, datasets, etc.). These files can dramatically increase the size of the repository and degrade performance.
Efficiency and Performance:
- Cloning and pulling from a repository with large files can be slow, consume a lot of bandwidth, and require significant storage space on every developer's machine. This inefficiency can hinder collaboration and development speed.
Repository Bloat:
- In standard Git, every version of every file is stored in the repository's history. This means that even if a large file is deleted from the current working directory, its history still resides in the Git database, leading to a bloated repository.

How Git LFS Works

Pointer Files:
- Git LFS replaces large files in your repository with tiny pointer files. When you commit a large file, Git LFS stores a reference (pointer) to that file in your repository, while the actual file data is stored in a separate server-side LFS storage.
LFS Storage:
- The large file contents are stored on a remote server configured for LFS, typically alongside your Git repository hosting service (like GitHub, GitLab, Bitbucket, etc.). This storage is separate from your main Git repository storage.
Version Tracking:
- Git LFS tracks versions of the large files separately. Every time you push or pull changes, Git LFS uploads or downloads the correct version of the large file from the LFS server, ensuring that you have the correct files in your working copy.
Selective Download:
- When cloning or pulling a repository, Git LFS downloads only the versions of large files needed for your current commit, reducing the time and bandwidth compared to downloading the entire history of every file.
Compatibility:
- Git LFS is compatible with existing Git services and workflows. It's an extension to Git, so you use the same Git commands. Repositories using Git LFS are still standard Git repos, ensuring backward compatibility.

Create a models directory

mkdir models

Move into the Axolotl models directory:

cd models

Enter the git clone command:

git clone https://huggingface.co/meta-llama/Meta-Llama-3-8B

The outcome should be new folder called LLama3-8b with all the appropriate files in it.

This is a screen cut from VS Code, showing all of the model files.

PreviousLlama3 NextAnalysis of model files

Last updated 1 year ago

Was this helpful?