-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
unraid pcie issues #26
Comments
I'm experiencing a similar issue where the second TPU, |
Thank you I will check it out |
I am having a heck of a time getting this working I added pci=noaer pcie_aspm=off to my unraid OS section. It seemed like it was working better but after about an hour or so the whole server just stops responding So now it works for a bit but then my whole server stops responding. I cannot SSH webgui nothing. I have to hard reboot it by holding the power button. I also don't think I can see logs as I have to reboot so I don't get the syslog. I thought I was on the right path but I guess not. |
Thanks for feedback and diagnostics info. I'm really interested to investigate cause of this issues to see if there's manufacturing flaw of particular incompatibility issue. Could you please contact me using form at the bottom of page here with your order number? |
@magic-blue-smoke ok I have reached out via the contact form. Thank you |
likewise. |
I was able to get both working again by removing the associated PCI devices and rescanning, however, this fix does not survive a reboot.
|
I figured out the problem I was having wasn't really a problem after all, turns out the host machine can't use a PCI device that has been passed into a docker container. Once I realized that I determined everything is working properly. |
What particular machine do you have? I was hoping to use this adapter with a Synology and pass the PCI coral to my docker container. |
@nmajin I'll try to explain in other words what @tehniemer mean When using VMs, they don't have direct access to hardware of your PC. Instead, VM environment emulates network card, drives, video adapter and other hardware. Coral TPU can't be emulated and needs PCIe pass through - a mechanism to "pull out" particular PCIe device from host PC and provide exclusive access to it within VM. Now if I get it right, adapter made both Coral TPUs available for use. However, one TPU was configured with PCIe passthrough to VM, another was not and remained available in host system. This is expected behavior and means that TPUs can be used in a number of combinations:
|
@magic-blue-smoke thanks for the detail and providing more context. So, to clarify both TPUs being available as passthrough (to a docker container), is that possible with this adapter and the dual edge TPU Coral? Sorry, just want to clarify I can in fact use both TPUs if and when I get the Coral and the adapter. |
In my configuration I have both TPUs passed through to a docker container. |
Just to add an additional anecdoate: I run Frigate with this dual-tpu-adapter in my Unraid Server, both TPUs are passed in and have not had any such issues, been running for about 3 months now. I was sure to disable all C-States for my CPU in the BIOS which is something I've always had to do do ensure stability with Unraid. |
@magic-blue-smoke you stated you were making another revision of this adapter? I am tempted to buy it and try again though. I feel like I will have the same issues as I did before. |
Hello,
I have been trying to get your PCIe adaptor to work for a few months now with no luck. I am using unraid with Frigate v0.10 Docker container. I can see both TPUs as apex_0 and apex_1. Symptom is Frigate will un for a bot then I get a PCIe error in my syslog for unraid. IT will then shutdown one of the TPUs and the Temp goes negative. I have posted my issues in the Frigate github and the unraid forums with no luck. I have reposted my unraid post below. Please let me know what else I can troubleshoot. Love all the work you have done for the community hoping to get this to work properly.
I am having a similar issue to @AdvancedMobileRepairs Using the Dual TPU in Magic-Blue-smoke PCIe adapter. Prior to this I was using a single TPU with a different adapter that was working fine. I have been monitoring the Coral Temperatures at they have not been going above 48 Degrees. I have this error in my syslog:
If anyone has any insight into this? I already asked in the Frigate github and we troubleshooted to a point but then they told me to ask in the unraid forum.
Thank you
EDIT EDIT:
Per this thread:
https://forums.unraid.net/topic/103901-solved-aer-pcie-bus-errors/
I disabled ASPM on PCIe in my BIOS. restarted server and running frigate to see how long it works before the coral shuts down.
And it failed again! That did not fix the issue. very weird
Temp is not the issue it seems
Any insight?
The text was updated successfully, but these errors were encountered: