rubert base cased sentiment new LLM Model

Introduction

The rubert-base-cased-sentiment-new is a text classification model designed for sentiment analysis of Russian texts. Developed by Tatyana Voloshina, it is based on the BERT architecture, focusing on identifying sentiment in Russian language inputs.

Architecture

The model is derived from the BERT architecture, specifically tailored for sentiment analysis. It categorizes input text into three sentiment labels: neutral, positive, and negative. The model is trained using the Tatyana/ru_sentiment_dataset.

Training

Training Data

The model utilizes the Tatyana/ru_sentiment_dataset for training, which contains labeled Russian text data for sentiment analysis.

Training Procedure

Details on preprocessing, speeds, sizes, and times are not provided.

Guide: Running Locally

To run the rubert-base-cased-sentiment-new model locally, follow these steps:

Install Required Packages

!pip install tensorflow-gpu
!pip install deeppavlov
!pip install fasttext
!pip install transformers

Set Up the Model Download the necessary PyTorch-trained model files from the provided Google Drive folder and place model.pth.tar in the same folder as the other model files.

Install Additional Components

!python -m deeppavlov install squad_bert
!python -m deeppavlov install bert_sentence_embedder

Load and Use the Model

from deeppavlov import build_model

model = build_model('path_to_model/rubert_sentiment.json')
model(["Сегодня хорошая погода", "Я счастлив проводить с тобою время", "Мне нравится эта музыкальная композиция"])

Suggested Cloud GPUs

For improved performance, consider using cloud GPUs from providers like AWS, Google Cloud, or Azure.

License

The specific licensing details for the rubert-base-cased-sentiment-new model are not provided. Users are advised to consult the model repository or contact the authors for more information.

More Related APIs in Text Classification