为了账号安全,请及时绑定邮箱和手机立即绑定

python如何解析指定div标签中的标签

python如何解析指定div标签中的标签

宝慕林4294392 2022-05-11 15:52:26
我已经尝试了这个网站上的其他解决方案,但仍然无法解决问题,我的问题是:<div class="ds"><div title="Today" class="dh">...<div title="Pazartesi" class="dh">26 Agu Pzt我想解析这个“ds”类中的所有 div 标题,谢谢你的帮助
查看完整描述

1 回答

?
慕妹3146593

TA贡献1820条经验 获得超9个赞

使用BeautifulSoup,lxml或类似模块代替regex.


美丽汤:


from bs4 import BeautifulSoup


text = '<div class="ds"><div title="Today" class="dh">...<div title="Pazartesi" class="dh">26 Agu Pzt'


soup = BeautifulSoup(text, 'html.parser')


for item in soup.select('.ds div[title]'):

    print(item['title'])


# or as list comprehensions    


titles = [item['title'] for item in soup.select('.ds div[title]')]

print(titles)

lxml:


import lxml.html


text = '<div class="ds"><div title="Today" class="dh">...<div title="Pazartesi" class="dh">26 Agu Pzt'


soup = lxml.html.fromstring(text)


for item in soup.cssselect('.ds div[title]'):

    print(item.attrib['title'])


# or as list comprehensions    


titles = [item.attrib['title'] for item in soup.cssselect('.ds div[title]')]

print(titles)

查询:


import pyquery


text = '<div class="ds"><div title="Today" class="dh">...<div title="Pazartesi" class="dh">26 Agu Pzt'


soup = pyquery.PyQuery(text)


for item in soup('.ds div[title]'):

    print(item.attrib['title'])


# or as list comprehensions    


titles = [item.attrib['title'] for item in soup('.ds div[title]')]

print(titles)

parsel : (由Scrapy 的 Selectors使用)


import parsel


sel = parsel.Selector(text)


for item in sel.css('.ds div[title]'):

    print(item.attrib['title'])


titles = [item.attrib['title'] for item in sel.css('.ds div[title]')]

print(titles)


查看完整回答
反对 回复 2022-05-11
  • 1 回答
  • 0 关注
  • 462 浏览
慕课专栏
更多

添加回答

举报

0/150
提交
取消
微信客服

购课补贴
联系客服咨询优惠详情

帮助反馈 APP下载

慕课网APP
您的移动学习伙伴

公众号

扫描二维码
关注慕课网微信公众号