基于Bert预训练模型的美团评价多标签多分类任务.zip_随机森林模型链路预测贴标签资源-CSDN文库

共7个文件

py：7个

版权申诉

bert

人工智能

预训练模型

多分类任务

48 浏览量 2024-01-07 17:58:16 上传评论收藏 7KB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

基于Bert 预训练模型的美团评价多标签多分类任务.zip （7个子文件）

multi-label-sentiment-classifications-main

model_test.py 2KB

consts.py 2KB

model_save.py 540B

model

sentiment_tagger.py 3KB

sentiment_dataset.py 2KB

model_eval.py 2KB

model_train.py 3KB

import torch import torch.nn as nn import pytorch_lightning as pl from transformers import BertModel, AdamW from pytorch_lightning.metrics.functional import accuracy, auroc BERT_MODEL_NAME = 'bert-base-chinese' class SentimentTagger(pl.LightningModule): def __init__(self, n_classes: int, n_training_steps=None, n_warmup_steps=None): super().__init__() self.roberta = BertModel.from_pretrained(BERT_MODEL_NAME, return_dict=True) self.classifier = nn.Linear(self.roberta.config.hidden_size, n_classes) self.n_training_steps = n_training_steps self.n_warmup_steps = n_warmup_steps self.criterion = nn.BCELoss() def forward(self, input_ids, attention_mask, labels=None): output = self.roberta(input_ids, attention_mask=attention_mask) output = self.classifier(output.pooler_output) output = torch.sigmoid(output) loss = 0 if labels is not None: loss = self.criterion(output, labels) return loss, output def training_step(self, batch, batch_idx): input_ids = batch["input_ids"] attention_mask = batch["attention_mask"] labels = batch["labels"].reshape(-1, 80) loss, outputs = self(input_ids, attention_mask, labels) self.log("train_loss", loss, prog_bar=True, logger=True) return {"loss": loss, "predictions": outputs, "labels": labels} def validation_step(self, batch, batch_idx): input_ids = batch["input_ids"] attention_mask = batch["attention_mask"] labels = batch["labels"].reshape(-1, 80) loss, outputs = self(input_ids, attention_mask, labels) self.log("val_loss", loss, prog_bar=True, logger=True) return loss def test_step(self, batch, batch_idx): input_ids = batch["input_ids"] attention_mask = batch["attention_mask"] labels = batch["labels"].reshape(-1, 80) loss, outputs = self(input_ids, attention_mask, labels) self.log("test_loss", loss, prog_bar=True, logger=True) return loss def training_epoch_end(self, outputs): labels = [] predictions = [] for output in outputs: for out_labels in output["labels"].detach().cpu(): labels.append(out_labels) for out_predictions in output["predictions"].detach().cpu(): predictions.append(out_predictions) labels = torch.stack(labels).int() predictions = torch.stack(predictions) for i, name in enumerate(LABEL_COLUMNS_ALL): class_roc_auc = auroc(predictions[:, i], labels[:, i]) self.logger.experiment.add_scalar(f"{name}_roc_auc/Train", class_roc_auc, self.current_epoch) def configure_optimizers(self): optimizer = AdamW(self.parameters(), lr=2e-5) scheduler = get_linear_schedule_with_warmup( optimizer, num_warmup_steps=self.n_warmup_steps, num_training_steps=self.n_training_steps ) return dict( optimizer=optimizer, lr_scheduler=dict( scheduler=scheduler, interval='step' ) )

评论收藏

内容反馈

版权申诉