Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

☘️【新需求】文档全文逐段审核 #506

Open
wisdomqin opened this issue Apr 21, 2024 · 0 comments
Open

☘️【新需求】文档全文逐段审核 #506

wisdomqin opened this issue Apr 21, 2024 · 0 comments

Comments

@wisdomqin
Copy link

需求背景

比如一篇文章逐段进行审核(合规性、错别字、语义不通顺等)

设计思考

  • 这个是否随着上下文长度的增加,直接通过一次请求解决呢?简单的错别字情况可能可以。但是如果需要引入专业知识的情况不行,比如需要根据一些行业规范审核是否合规的情况。
  • 另外就是拆分成小段逐个审核效果应该会更好。

实现思路

image

如图所示,可以支持 document类 触点与 Preset question类触点相连。

通常来讲,应当是 textsplitters 中的组件与Preset question触点相连,这样用户可以比如上传一份合同,先通过切分符(如换行符)进行切分(如切分成20个段落),每个段落成为一个输入逐个传入到Chain/Agent(如RetrievalQAChain或LLMChain)进行分析和审核,比如传入LLMChain可以通过提示词来做错别字及语义不通顺的审核,比如传入RetrievalQAChain来做合规性审核(知识库中存储的是公司及行业的规范要求文档内容)

@wisdomqin wisdomqin changed the title 文档全文逐段审核 【新需求】文档全文逐段审核 Apr 21, 2024
@wisdomqin wisdomqin changed the title 【新需求】文档全文逐段审核 ☘️【新需求】文档全文逐段审核 Apr 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant