simpleT5 库 | 根据英文摘要内容生成标题

Original 大邓大邓和他的Python

2024-09-09

simpleT5 是基于 PyTorch 实现的 T5 模型库，旨在为用户提供一种简单、易用、可定制的 T5 模型工具。T5（Text-to-Text Transfer Transformer）是一种基于 Transformer 架构的自然语言处理模型，由 Google Brain 团队开发。T5 模型采用了 encoder-decoder 架构，其中 encoder 将输入文本编码为向量，decoder 则从该向量生成目标文本。

simpleT5 的设计目标是尽可能地减少 T5 模型的使用门槛，以方便用户在自然语言处理任务中快速应用 T5 模型，从而节省大量的模型开发时间和成本。

simpleT5 提供了一个简单的 API 接口，用户只需要提供输入文本和模型参数，即可轻松地使用 T5 模型进行文本转换任务，如文本摘要、机器翻译、对话系统等。simpleT5 还提供了一些预训练模型，包括 T5-small、T5-base 和 T5-large 等不同规模的模型，用户可以根据任务需求选择合适的模型。

除此之外，simpleT5 还提供了一些有用的工具和功能，如文本预处理、数据集加载、训练日志记录等，以帮助用户更轻松地进行模型训练和调试。simpleT5 的开发者们还提供了详细的文档和示例代码，以帮助用户更快地上手使用该库。

总之，simpleT5 为用户提供了一种快速、方便、可定制的 T5 模型工具，可以帮助用户在自然语言处理任务中更加高效地应用 T5 模型，节省大量的开发时间和成本。

今天以huggingface中公开的模型 snrspeaks/t5-one-line-summary为例，展示「根据传入的摘要内容生成对应的标题」。

安装

!pip3 install --upgrade simplet5

快速上手

# pip install --upgrade simplet5
from simplet5 import SimpleT5
model = SimpleT5()
model.load_model("t5","snrspeaks/t5-one-line-summary")

Run

    Global seed set to 42
    Downloading:   100%|          | 0.00/1.36k [00:00<?, ?B/s]
    Downloading:   100%|          | 0.00/850M [00:00<?, ?B/s]
    Downloading:   100%|          | 0.00/1.84k [00:00<?, ?B/s]
    Downloading:   100%|          | 0.00/773k [00:00<?, ?B/s]
    Downloading:   100%|          | 0.00/1.32M [00:00<?, ?B/s]
    Downloading:   100%|          | 0.00/1.74k [00:00<?, ?B/s]

根据英文摘要生成标题

abstract = """We describe a system called Overton, whose main design goal is to support engineers in building, monitoring, and improving production 
machine learning systems. Key challenges engineers face are monitoring fine-grained quality, diagnosing errors in sophisticated applications, and 
handling contradictory or incomplete supervision data. Overton automates the life cycle of model construction, deployment, and monitoring by providing a 
set of novel high-level, declarative abstractions. Overton's vision is to shift developers to these higher-level tasks instead of lower-level machine learning tasks. 
In fact, using Overton, engineers can build deep-learning-based applications without writing any code in frameworks like TensorFlow. For over a year, 
Overton has been used in production to support multiple applications in both near-real-time applications and back-of-house processing. In that time, 
Overton-based applications have answered billions of queries in multiple languages and processed trillions of records reducing errors 1.7-2.9 times versus production systems.
"""

model.predict(abstract)

Run

['Overton: Building, Deploying, and Monitoring Deep Machine Learning Systems']

根据摘要生成多个标题

abstract = """We describe a system called Overton, whose main design goal is to support engineers in building, monitoring, and improving production 
machine learning systems. Key challenges engineers face are monitoring fine-grained quality, diagnosing errors in sophisticated applications, and 
handling contradictory or incomplete supervision data. Overton automates the life cycle of model construction, deployment, and monitoring by providing a 
set of novel high-level, declarative abstractions. Overton's vision is to shift developers to these higher-level tasks instead of lower-level machine learning tasks. 
In fact, using Overton, engineers can build deep-learning-based applications without writing any code in frameworks like TensorFlow. For over a year, 
Overton has been used in production to support multiple applications in both near-real-time applications and back-of-house processing. In that time, 
Overton-based applications have answered billions of queries in multiple languages and processed trillions of records reducing errors 1.7-2.9 times versus production systems.
"""

#根据摘要生成5个标题
n = 5
model.predict(abstract, 
              num_return_sequences=n, 
              num_beams=n+1)

Run

['Overton: Building, Deploying, and Monitoring Deep Machine Learning Systems',
 'Overton: Building, Deployment, and Improving Production Machine Learning Systems',
 'Overton: Building, Deploying, and Monitoring Machine Learning Systems for Engineers',
 'Overton: Building, Deploying, and Monitoring Machine Learning Systems',
 'Overton: Building, Deployment, and Monitoring Deep Machine Learning Systems']

精选文章

管理世界 | 使用文本分析词构建并测量短视主义
管理世界 | 使用经营讨论与分析测量企业数字化指标
管理世界 | 用正则表达式、文本向量化、线性回归算法从md&a数据中计算「企业融资约束指标」
文本分析 | MD&A 信息含量指标构建代码实现
使用 Word2Vec 和 TF-IDF 计算五类企业文化
金融研究 | 使用Python构建「关键审计事项信息含量」
支持开票 | Python实证指标构建与文本分析
金融研究 | 文本相似度计算与可视化
转载 | 社会计算驱动的社会科学研究方法
推荐 | 社科(经管)文本分析快速指南
视频分享 | 文本分析在经管研究中的应用
转载 | 金融学文本大数据挖掘方法与研究进展
转载 | 大数据驱动的「社会经济地位」分析研究综述
FinBERT | 金融文本BERT模型，可情感分析、识别ESG和FLS类型
使用 Word2Vec 和 TF-IDF 计算五类企业文化
如何用「图嵌入」将企业、高管职业经历表征为向量数据
JM2022综述 | 黄金领域: 为营销研究(新洞察)采集网络数据
可视化 | 绘制《三体》人物关系网络图
MS | 使用网络算法识别创新的颠覆性与否
认知的测量 | 向量距离vs语义投影
Asent库 | 英文文本数据情感分析
PNAS | 文本网络分析&文化桥梁Python代码实现
PNAS | 使用语义距离测量一个人的创新力(发散思维)得分
tomotopy | 速度最快的LDA主题模型

继续滑动看下一个

大邓和他的Python

向上滑动看下一个

故意按摩让女生“产生欲望”后发生关系，算性侵吗？

洗牌电商圈！阿哲放话全网：挑战抖音所有机制！爆全品类大牌！

阿哲现身评论区，@一修！肉肉痛哭，无限期停播！回应舆论黑料，关闭私信评论区！

登热榜！某牙电母被S，榜一求爱遭拒！柚柚阿哲合体年度走红毯！

小敏感喊话阿哲，出镜抖音！欠钱不还，小白龙再被扒借贷官司！

simpleT5 库 | 根据英文摘要内容生成标题

安装

快速上手

更多模型

精选文章

您可能也对以下帖子感兴趣

故意按摩让女生“产生欲望”后发生关系，算性侵吗？

洗牌电商圈！阿哲放话全网：挑战抖音所有机制！爆全品类大牌！

阿哲现身评论区，@一修！肉肉痛哭，无限期停播！回应舆论黑料，关闭私信评论区！

登热榜！某牙电母被S，榜一求爱遭拒！柚柚阿哲合体年度走红毯！

小敏感喊话阿哲，出镜抖音！欠钱不还，小白龙再被扒借贷官司！

生成图片，分享到微信朋友圈

simpleT5 库 | 根据英文摘要内容生成标题

安装

快速上手

更多模型

精选文章

您可能也对以下帖子感兴趣