scrapy
lxml[html_clean]
lxml_html_clean
newspaper4k
torch
transformers
itemadapter
