esmc 600m 2024 12
EvolutionaryScaleIntroduction
The ESMC-600M-2024-12 is part of the ESM Cambrian model family, which is designed to generate representations of the underlying biology of proteins. This model family offers significant performance improvements over previous versions by scaling up data and training compute.
Architecture
ESM Cambrian models, including the ESMC-600M-2024-12, are parallel to the ESM3 generative models. They focus on understanding protein biology by scaling up to 6 billion parameters, allowing them to achieve state-of-the-art performance in protein language modeling. This scaling results in substantial improvements in inference time and performance, even surpassing larger models from earlier generations.
Training
The ESM Cambrian models leverage increased data and computational power to enhance their capabilities compared to earlier models, such as ESM2. The result is a more efficient model that can deliver high-quality biological representations of proteins.
Guide: Running Locally
-
Installation: To use the ESMC-600M-2024-12 model, you need to install the
esm
package. This can be done via pip:pip install esm
-
Repository: For further information on utilizing the model, refer to the README and notebooks available in the ESM GitHub repository.
-
Hardware Suggestions: Running this model efficiently may require significant computational resources. It is recommended to utilize cloud GPUs such as those provided by AWS, Google Cloud, or Azure for optimal performance.
License
The ESMC-600M-2024-12 model is distributed under a custom non-commercial license. Details of the license can be found here.