Skip to content

DropPath implementation #2119

Closed Answered by rwightman
IsmaelElsharkawi asked this question in Q&A
Discussion options

You must be logged in to vote

@IsmaelElsharkawi better to divide at train time so the next layer gets consistent activation stats than muck around at test time :)

And note, you can see a note a TF impl of this that accompanied the original EfficientNet code, they called it drop connect (which conflicted with another paper name)

https://github.com/tensorflow/tpu/blob/master/models/official/efficientnet/utils.py#L276-L291

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@IsmaelElsharkawi
Comment options

Answer selected by IsmaelElsharkawi
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants