Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide password occurences #20

Open
mdPlusPlus opened this issue Jun 17, 2017 · 14 comments
Open

Provide password occurences #20

mdPlusPlus opened this issue Jun 17, 2017 · 14 comments
Assignees

Comments

@mdPlusPlus
Copy link

mdPlusPlus commented Jun 17, 2017

Could you please provide how often the passwords occur?

This way one could build adequately weighted probable password masks for hashcat.

@berzerk0
Copy link
Owner

Yes, but for Revision 2. Set to release that by mid-July.

Also, might not be for the biggest files, since the addition of count will add a lot of file size.
However, I plan on only including targz and 7z in Rev 2, so there will be more room for different types of files.

I'll put this on the official Revision 2 Task List.

@berzerk0 berzerk0 self-assigned this Jun 17, 2017
@berzerk0
Copy link
Owner

If in the meantime, someone is looking for a jury-rigged way to approximate this, try something like

cat WORDLIST | egrep -n '.+'  | cut -d ':' -f 2- 

to get a ranking with line numbers.

Also know that the files themselves are sized based on popularity.
Smallest Files: Lines Appeared at least 75 Times,
Then at least 50, 25, 10, 5 and 2 times.

One could theoretically remove lines from say, the 75+ file, out of the 50+ file and have all the passwords that appeared 50-74 times, which could be given a different weight than the 75+.

For example:

Remove the top 196 from the top 3575 to get the 197th-3575th most popular lines

Give the lines in 196 a heavier weight than the lines in 197-3575

Not as accurate as a true appearance count, but it might provide some use.

Real deal looking like Mid/Late July, I'm nearing the end of the human work stage and more into the script work stage.

@berzerk0
Copy link
Owner

I'll include masks/rules in Rev 2.

Real deal looking like Mid/Late July, I'm nearing the end of the human work stage and more into the script work stage.

hahahaha

@cMadan
Copy link

cMadan commented Mar 13, 2018

Are the occurrence/frequency counts stored somewhere? I don't see them in the Real Passwords lists (https://github.com/berzerk0/Probable-Wordlists/tree/master/Real-Passwords), but maybe I'm looking in the wrong place?

@berzerk0
Copy link
Owner

I have not included a file where the number of appearances for a line is paired with the line itself.

I have however, in Analysis-Files, included masks and rules that are based on that information.

Did you want to create a mask or some other kind of analysis?

@cMadan
Copy link

cMadan commented Mar 13, 2018

I am interested in looking at the characteristics of passwords people use, so frequency counts are useful--rather than just rank information. I don't think the rules included in that folder can help with this analysis.

@berzerk0
Copy link
Owner

The masks include character type, number and order.
ProbWL-mask-probable-v2-counts.txt contains appearance counts for the masks it found.

Are you looking for more depth than that?

@cMadan
Copy link

cMadan commented Mar 13, 2018

I think I'm looking for something simpler than that, e.g., in Top1575-probable-v2.txt, what are the frequencies for "dragon" and "sunshine".

@berzerk0
Copy link
Owner

I suppose I could throw together those for the smaller files, sure.

@berzerk0 berzerk0 reopened this Mar 13, 2018
@cMadan
Copy link

cMadan commented Oct 22, 2018

Hi! Any plans for implementing this--or suggestions on how I could do it myself? Thanks!

@berzerk0
Copy link
Owner

Apologies, this project has taken a bit of a backseat. I’ve got the files that I can make these lists with - just gotta find the time to get them prepped.

This week is unlikely but this weekend perhaps

@berzerk0
Copy link
Owner

Uploaded for 1575, 12Thousand and 304Thousand with appearance counts

https://github.com/berzerk0/Probable-Wordlists/tree/master/Analysis-Files

@cMadan
Copy link

cMadan commented Oct 27, 2018

Great, thank you!!

@MauriRios
Copy link

can you make a dic.. with numbers 10.000.000 to 80.000.000? thats the DNI from argentain, i must peopel put that dni number in hers wifi

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants