Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

对图像进行坐标检测,生成的bbox是resize成正方形之后的值吗? #199

Open
zhaop-l opened this issue Jan 23, 2024 · 5 comments

Comments

@zhaop-l
Copy link

zhaop-l commented Jan 23, 2024

对图像进行坐标检测,生成的bbox是resize成正方形之后的值吗?

因为实例代码是将图像resize成正方形了,我跑出来结果有点偏差,所以想问下你们是正方形吗?

@fansticOne
Copy link

请问如何设计prompt来对图像进行坐标检测,输出bbox?感谢

@zhaop-l
Copy link
Author

zhaop-l commented Jan 31, 2024

请问如何设计prompt来对图像进行坐标检测,输出bbox?谢谢

Please provide the bounding box coordinate of the region this sentence describes : people on car.

@fansticOne
Copy link

请问如何设计prompt来对图像进行坐标检测,输出bbox?谢谢

Please provide the bounding box coordinate of the region this sentence describes : people on car.
试了没有给出bbox呀,回答的是:
The bounding box coordinate of the region this sentence describes is the sheep lying on the ground.

@LukeForeverYoung
Copy link
Collaborator

Yes, the images are resized to squares for example 448x448. But, the generated coordinates should be a value in the range [0,1], which are ratios that are unrelated to the actual resolution. Therefore, whatever you resize the image, the coordinates are always applicable. You may need to check if the image has been cropped, as this could cause the coordinates to be offset relative to the original image.

@fansticOne
Copy link

请问一下,对图像进行坐标检测,使用的是哪个权重文件?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants