We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
使用GeneralNewsExtractor的extract方法进行内容抽取的时候如果加了对body内容的xpath配置就报错
如何复现
屏幕截图
使用环境:
The text was updated successfully, but these errors were encountered:
你可以显看看,你获取到的html_content里面,有没有rich_media_content这个class
Sorry, something went wrong.
返回的是有那个class的,只是这个方法里面有个selector参数,在这个地方源码没有传,导致进去用下标获取时会报错
这个 selector 参数就是我传进去的 element。
kingname
No branches or pull requests
使用GeneralNewsExtractor的extract方法进行内容抽取的时候如果加了对body内容的xpath配置就报错
如何复现
body = selector.xpath(body_xpath)[0]
IndexError: list index out of range
屏幕截图
使用环境:
The text was updated successfully, but these errors were encountered: