Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] 最近一次年报 被识别成了 次年 #131

Open
griffinqiu opened this issue Apr 24, 2023 · 1 comment
Open

[BUG] 最近一次年报 被识别成了 次年 #131

griffinqiu opened this issue Apr 24, 2023 · 1 comment
Labels
bug Something isn't working

Comments

@griffinqiu
Copy link

语言识别有点点问题,问最近一次年报别识别成了“次年”,准确的拆词应该是"最近一次", "年报"。

如:
“特斯拉最近一次年报的股东权益是多少?”

  1. 版本(Version):
  • python 版本: 3.10.5
  • jionlp 版本: 1.4.35
  1. jionlp的调用代码与输入文本(Code & Text):
import time
import json
import jionlp as jio

res = jio.ner.extract_time('特斯拉最近一次年报的股东权益是多少?', time_base=time.time(), with_parsing=False)
print(json.dumps(res, ensure_ascii=False, indent=4, separators=(',', ':')))
for r in res:
    res = jio.parse_time(r['text'], time_base=time.time())
    print(res)
  1. 调用报错日志如下(Log):
[
    {
        "text":"次年",
        "offset":[
            6,
            8
        ],
        "type":"time_span"
    }
]
{'type': 'time_span', 'definition': 'accurate', 'time': ['2024-01-01 00:00:00', '2024-12-31 23:59:59']}

期望行为(Expectation)

应该提取不出时间(以为JioNLP 没有支持财报的数据)

请顺手 star 一下右上角的⭐小星星

@griffinqiu griffinqiu added the bug Something isn't working label Apr 24, 2023
@dongrixinyu
Copy link
Owner

需要结合分词工具,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants