Lxyuan

Models by this creator

🚀

distilbert-base-multilingual-cased-sentiments-student

208

distilbert-base-multilingual-cased-sentiments-student is a distilled version of a zero-shot classification pipeline on the Multilingual Sentiment dataset. It was created by lxyuan using a process of knowledge distillation, where a larger "teacher" model (in this case, MoritzLaurer/mDeBERTa-v3-base-mnli-xnli) is used to train a smaller "student" model (in this case, distilbert-base-multilingual-cased). This allows the student model to achieve high performance while being more efficient and lightweight. The model is capable of performing zero-shot sentiment classification on multilingual text, determining whether a given piece of text has a positive, negative, or neutral sentiment. It can handle text in English, Malay, Japanese, and potentially other languages. This makes it useful for applications that require sentiment analysis across multiple languages, without the need for language-specific training data. Model Inputs and Outputs Inputs Text**: A piece of text, in any of the supported languages (English, Malay, Japanese, etc.), to be classified for sentiment. Outputs Sentiment scores**: A list of three dictionaries, each containing the following keys: label: The sentiment label ('positive', 'neutral', or 'negative') score: The probability of the corresponding sentiment label Capabilities The distilbert-base-multilingual-cased-sentiments-student model can perform zero-shot sentiment classification on multilingual text. For example: from transformers import pipeline distilled_student_sentiment_classifier = pipeline( model="lxyuan/distilbert-base-multilingual-cased-sentiments-student", return_all_scores=True ) English distilled_student_sentiment_classifier("I love this movie and i would watch it again and again!") Output: [[{'label': 'positive', 'score': 0.9731044769287109}, {'label': 'neutral', 'score': 0.016910076141357422}, {'label': 'negative', 'score': 0.009985478594899178}]] Malay distilled_student_sentiment_classifier("Saya suka filem ini dan saya akan menontonnya lagi dan lagi!") Output: [[{'label': 'positive', 'score': 0.9760093688964844}, {'label': 'neutral', 'score': 0.01804516464471817}, {'label': 'negative', 'score': 0.005945465061813593}]] Japanese distilled_student_sentiment_classifier("") Output: [[{'label': 'positive', 'score': 0.9342429041862488}, {'label': 'neutral', 'score': 0.040193185210227966}, {'label': 'negative', 'score': 0.025563929229974747}]] What Can I Use It For? The distilbert-base-multilingual-cased-sentiments-student model can be used in a variety of applications that require multilingual sentiment analysis, such as: Social media monitoring**: Analyzing customer sentiment across multiple languages on social media platforms. Product reviews**: Aggregating and analyzing product reviews from customers in different countries and languages. Market research**: Gauging public opinion on various topics or events in a global context. Customer service**: Automatically detecting the sentiment of customer inquiries or feedback in different languages. By using this distilled and efficient model, you can build sentiment analysis pipelines that are fast, scalable, and capable of handling text in multiple languages. Things to Try One interesting aspect of this model is that it was trained using a process of knowledge distillation, where a larger "teacher" model was used to train a smaller "student" model. This allows the student model to achieve high performance while being more efficient and lightweight. You could try experimenting with the model's performance and compare it to the original teacher model, MoritzLaurer/mDeBERTa-v3-base-mnli-xnli, to see how much the distillation process has impacted the model's accuracy and speed. Additionally, you could explore using this model as a starting point for further fine-tuning on domain-specific sentiment analysis tasks, potentially leading to even better performance for your particular use case.

Updated 5/28/2024

Text-to-Text