【python爬虫】抓取炒股概念

最新推荐文章于 2025-05-29 09:00:52 发布

原创

最新推荐文章于 2025-05-29 09:00:52 发布 · 4.9k 阅读

6 ·

CC 4.0 BY-SA版权

文章标签：

#python #自然语言处理

本文介绍了如何使用Python进行网页爬虫，特别地，分享了作者抓取炒股概念的实践经验。参考了https://www.cnblogs.com/xin-xin/p/4297852.html的详细教程，并提供了自己实现的爬虫代码，通过火狐浏览器的开发者工具分析网络请求，解析目标地址来获取数据。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

非常感谢https://www.cnblogs.com/xin-xin/p/4297852.html。该系列讲解很详细。

另附上我写的抓取炒股概念代码。

采用火狐浏览器，F12，选取Network，解析一下传送的地址。

import urllib.request
import re
import requests

# def main():
#     # url = "http://www.iwencai.com/school/dictionary?qs=study_dictonary_stock"
#     # url='http://www.iwencai.com/yike/article-class-list?tagId=37'
#     url="http://www.iwencai.com/yike/detail/auid/716981f756614a79"
#     try:
#         data = urllib.request.urlopen(url).read()
#         content = data.decode('UTF-8')
#
#         # pattern = re.compile('<div class="term_top clearfix">.*?<a.*?point_info="title">(.*?)</a></div>.*?'
#