Building a Comment Toxicity Ranker Using Hugging Face’s Transformer Models | by Jacky Kaub

[ad_1]

Catching up on NLP and LLM (Half I)

18 min learn

10 hours in the past

As a Information Scientist, I’ve by no means had the chance to correctly discover the most recent progress in Pure Language Processing. With the summer season and the brand new growth of Massive Language Fashions because the starting of the yr, I made a decision it was time to dive deep into the sphere and embark on some mini-projects. In any case, there’s by no means a greater method to be taught than by practising.

As my journey began, I spotted it was sophisticated to seek out content material that takes the reader by the hand and goes, one step at a time, in direction of a deep comprehension of latest NLP fashions with concrete tasks. That is how I made a decision to start out this new sequence of articles.

Constructing a Remark Toxicity Ranker Utilizing HuggingFace’s Transformer Fashions

On this first article, we’re going to take a deep dive into constructing a remark toxicity ranker. This undertaking is impressed by the “Jigsaw Rate Severity of Toxic Comments” competition which occurred on Kaggle final yr.

The target of the competitors was to construct a mannequin with the capability to find out which remark (out of two feedback given as enter) is essentially the most poisonous.

To take action, the mannequin will attribute to each remark handed as enter a rating, which determines its relative toxicity.

What this text will cowl

On this article, we’re going to practice our first NLP Classifier utilizing Pytorch and Hugging Face transformers. I can’t go into the small print of how works transformers, however extra into sensible particulars and implementations and provoke some ideas that can be helpful for the subsequent articles of the sequence.

Specifically, we’ll see:

How one can obtain a mannequin from Hugging Face Hub
How one can customise and use an Encoder
Construct and practice a Pytorch ranker from one of many Hugging Face fashions

This text is straight addressed to information scientists that want to step their sport in NLP from a sensible viewpoint. I can’t do a lot…

[ad_2]

Source link

Building a Comment Toxicity Ranker Using Hugging Face’s Transformer Models | by Jacky Kaub | Aug, 2023

Slamcore releases updated SDK enabling person detection in warehouses and manufacturing

Meet Jupyter AI: A New Open-Source Project that brings Generative Artificial Intelligence to Jupyter Notebooks with Magic Commands and a Chat Interface

Editor

Meet Jupyter AI: A New Open-Source Project that brings Generative Artificial Intelligence to Jupyter Notebooks with Magic Commands and a Chat Interface

Leave a Reply Cancel reply

Browse by Category

Categories

Recommended

Building a Comment Toxicity Ranker Using Hugging Face’s Transformer Models | by Jacky Kaub | Aug, 2023

Catching up on NLP and LLM (Half I)

Constructing a Remark Toxicity Ranker Utilizing HuggingFace’s Transformer Fashions

What this text will cowl

Slamcore releases updated SDK enabling person detection in warehouses and manufacturing

Meet Jupyter AI: A New Open-Source Project that brings Generative Artificial Intelligence to Jupyter Notebooks with Magic Commands and a Chat Interface

Editor

Meet Jupyter AI: A New Open-Source Project that brings Generative Artificial Intelligence to Jupyter Notebooks with Magic Commands and a Chat Interface

Leave a Reply Cancel reply

Browse by Category

Browse by Tags

Categories

Recommended