Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

List of suggestions for sources of hosts data #2573

Open
DiogoMiguelCunha opened this issue Feb 14, 2024 · 7 comments
Open

List of suggestions for sources of hosts data #2573

DiogoMiguelCunha opened this issue Feb 14, 2024 · 7 comments

Comments

@DiogoMiguelCunha
Copy link

Hello. I've browsed GitHub for years, but never created an account, so that's why it's new.
I hope this is the right place for such a suggestion.

I've been using your files for some time on OS, and recently on uBlock. I've recently searched for more lists that don't seen to have too many duplicates, and although I don't know how you would implement them on yours, I want to contribute and leave all the references and resources I found.
Since you mainly deal with Hosts, I've tried to find lists for that, with the "0.0.0.0", but even the other formats might be useful.

01. Scam Blocklist - "A blocklist to protect users against untrustworthy sites."

02. Orthrus BlockList - "List to block ads, trackers & malwares. Plus 200.000 unique domains and about 4 MB in size."

03a. NoTrack Malware Blocklist

03b. NoTrack Tracker Blocklist

04. Adblock-Nocoin-List - "Block lists to prevent JavaScript miners."

05. Fuck Fuckadblock: Mining - "Filters for blocking browser-based miners" (for browsers, but the sites themselves might be useful)

06. Hexxium Creations Threat List - "(...) scam, phishing, deceptive content, exploit, and tech support scam sites."

07. Stalkerware-Indicators - "Indicators of stalkerware apps"

08. RPiList Specials - "Protection against fake shops, advertising, tracking and other attacks from the Internet" (translated from German)

Copy link

welcome bot commented Feb 14, 2024

Hello! Thank you for opening your first issue in this repo. It’s people like you who make these host files better!

@StevenBlack
Copy link
Owner

Thank you for this Diogo @DiogoMiguelCunha.

This is good research! I'm always looking for good lists to add.

By the way, "...don't seen to have too many duplicates" is not necessarily a good thing, here's why: this list amalgamates the work of several long-time list curators. Some of our sources curate their lists daily, with impressive diligence.

When a list arises with few duplicates, I always wonder how this potential curator knows what's been missed by several other diligent curators combined over a long period of time. A big list with few duplicates with ours always makes me wonder 🤔

@StevenBlack
Copy link
Owner

StevenBlack commented Feb 15, 2024

I'm going to use the panels below ⬇️ to assess each of the candidates you've found. I am using ghosts for this.

I expect this is gonna take awhile because I have limited availability this week.

@StevenBlack
Copy link
Owner

StevenBlack commented Feb 15, 2024

Assessing https://raw.githubusercontent.com/durablenapkin/scamblocklist/master/hosts.txt

  • 5,003 unique domains
  • 62 unique TLDs
  • Heavy on .info which is probably a good thing.
  • 0.85% duplicate rate which is suspiciously low.
  • List is NOT ALPHABETICIZED which makes it very hard to curate by the list owners.
$ ghosts --tld -c https://raw.githubusercontent.com/durablenapkin/scamblocklist/master/hosts.txt

----------------------------------------
Base hosts file summary:
----------------------------------------
Location: https://raw.githubusercontent.com/StevenBlack/hosts/master/hosts
Domains: 152,013
Bytes: 4.6 MB
TLD tally:  (352 unique TLD)
   com: 67,516
   pl: 9,793
   site: 8,483
   net: 7,558
   top: 7,175
   xyz: 5,503
   click: 2,931
   info: 2,526
   online: 2,416
   live: 2,299
   app: 2,212
   org: 2,198
   cfd: 1,942
   sbs: 1,785
   br: 1,514
   shop: 1,509
   life: 1,486
   space: 1,242
   fr: 1,232
   eu: 1,059
   vn: 972
   io: 939
   uk: 811
   co: 793
   ru: 761
   pro: 732
   jp: 681
   pw: 592
   me: 573
   de: 531
   dev: 474
   store: 472
   cc: 422
   club: 410
   fun: 363
   link: 324
   cloud: 323
   website: 316
   cyou: 313
   us: 299
   nl: 288
   homes: 281
   quest: 276
   cn: 270
   it: 266
   tv: 242
   icu: 232
   biz: 227
   at: 164
   monster: 158
   ml: 146
   es: 137
   buzz: 133
   tech: 133
   in: 127
   lat: 124
   se: 121
   autos: 120
   cz: 116
   mobi: 109
   world: 102
   ca: 101
   one: 99
   sh: 97
   name: 96
   au: 94
   boats: 94
   beauty: 93
   skin: 92
   cam: 91
   ws: 88
   be: 88
   pics: 86
   asia: 86
   vip: 85
   page: 83
   ink: 82
   lk: 78
   hair: 78
   hu: 75
   pt: 70
   gd: 65
   ro: 63
   digital: 59
   bio: 59
   cl: 58
   dk: 54
   lol: 53
   id: 53
   ai: 52
   network: 51
   bond: 49
   wiki: 48
   ua: 48
   zone: 48
   bid: 48
   best: 48
   tw: 46
   su: 45
   goog: 44
   to: 41
   ir: 40
   tr: 40
   za: 39
   ch: 37
   fi: 36
   host: 35
   makeup: 34
   mx: 34
   rest: 34
   company: 34
   kr: 34
   uno: 34
   trade: 32
   yachts: 32
   ar: 31
   foundation: 29
   today: 29
   care: 28
   nz: 28
   guru: 27
   tk: 26
   work: 26
   gr: 25
   mom: 24
   software: 22
   st: 22
   im: 22
   art: 22
   la: 22
   help: 22
   ng: 21
   cm: 20
   bg: 20
   ph: 20
   sk: 19
   blog: 17
   no: 16
   pk: 16
   cool: 16
   win: 15
   ga: 15
   pet: 15
   ag: 14
   il: 14
   lt: 14
   pm: 14
   lu: 14
   services: 13
   pe: 13
   stream: 13
   gay: 12
   sv: 12
   my: 12
   by: 12
   kz: 11
   gg: 11
   ovh: 11
   media: 11
   ug: 11
   ltd: 10
   sg: 10
   cx: 10
   gt: 10
   charity: 10
   bet: 10
   news: 9
   si: 9
   am: 9
   lv: 9
   bar: 9
   run: 8
   th: 8
   ie: 8
   support: 8
   agency: 8
   ae: 8
   ee: 8
   cf: 8
   solutions: 7
   ba: 7
   tn: 7
   plus: 7
   nu: 7
   group: 7
   casa: 7
   ke: 7
   rs: 7
   gives: 6
   tube: 6
   capital: 6
   do: 6
   gift: 6
   press: 6
   academy: 6
   edu: 6
   fyi: 6
   np: 6
   ly: 6
   hk: 6
   gov: 6
   hn: 5
   bd: 5
   exchange: 5
   gq: 5
   ms: 5
   delivery: 5
   so: 5
   re: 5
   li: 5
   fund: 5
   video: 4
   systems: 4
   marketing: 4
   ad: 4
   technology: 4
   energy: 4
   al: 4
   kim: 4
   gf: 4
   wtf: 4
   games: 4
   zw: 4
   mn: 4
   py: 4
   chat: 4
   nf: 4
   tattoo: 4
   promo: 4
   sale: 4
   trading: 4
   africa: 3
   team: 3
   cat: 3
   bz: 3
   social: 3
   eus: 3
   mg: 3
   global: 3
   gy: 3
   gs: 3
   cash: 3
   fm: 3
   pl-com: 3
   date: 3
   vision: 3
   mk: 3
   mt: 3
   ac: 3
   pub: 3
   email: 3
   studio: 2
   rw: 2
   dog: 2
   ge: 2
   codes: 2
   international: 2
   moscow: 2
   property: 2
   coffee: 2
   rocks: 2
   wang: 2
   sl: 2
   supply: 2
   land: 2
   ceo: 2
   mv: 2
   ao: 2
   review: 2
   vc: 2
   kg: 2
   money: 2
   mr: 2
   motorcycles: 2
   sx: 2
   love: 2
   city: 2
   credit: 2
   pictures: 2
   accountant: 2
   cafe: 2
   center: 2
   fo: 2
   lc: 2
   photography: 2
   aws: 2
   business: 2
   dating: 2
   download: 2
   expert: 2
   domains: 2
   fans: 2
   engineer: 2
   md: 2
   tips: 2
   ps: 2
   hr: 2
   sc: 2
   exposed: 2
   uy: 2
   auction: 2
   vg: 1
   mu: 1
   jobs: 1
   frl: 1
   localdomain: 1
   gdn: 1
   as: 1
   ma: 1
   is: 1
   pa: 1
   uz: 1
   report: 1
   jo: 1
   vin: 1
   gold: 1
   gl: 1
   camp: 1
   tools: 1
   cu: 1
   watch: 1
   red: 1
   science: 1
   example0101: 1
   ong: 1
   bw: 1
   markets: 1
   dz: 1
   works: 1
   bo: 1
   school: 1
   ki: 1
   om: 1
   lan: 1
   ht: 1
   direct: 1
   community: 1
   xn--p1ai: 1
   ngo: 1
   porn: 1
   ve: 1
   ne: 1
   sa: 1
   tc: 1
   financial: 1
   bnpparibas: 1
   eg: 1
   tokyo: 1
   glass: 1
   finance: 1
   style: 1
   photos: 1
----------------------------------------
----------------------------------------
Compared hosts file summary:
----------------------------------------
Location: https://raw.githubusercontent.com/durablenapkin/scamblocklist/master/hosts.txt
Domains: 5,003
Bytes: 219 kB
TLD tally:  (62 unique TLD)
   com: 2,468
   info: 2,001
   shop: 65
   ru: 36
   net: 36
   top: 33
   online: 29
   org: 24
   live: 22
   io: 21
   de: 18
   store: 18
   co: 16
   us: 15
   site: 15
   uk: 13
   one: 9
   link: 9
   it: 8
   cc: 8
   br: 8
   vip: 8
   xyz: 7
   pro: 7
   id: 7
   kr: 5
   delivery: 5
   me: 5
   biz: 5
   pl: 5
   app: 4
   click: 4
   no: 4
   club: 4
   gifts: 3
   trade: 3
   broker: 3
   tech: 3
   au: 3
   eu: 3
   space: 3
   vn: 3
   to: 3
   is: 3
   uy: 2
   pw: 2
   do: 2
   bio: 2
   website: 2
   quest: 2
   za: 2
   ca: 2
   jp: 2
   nz: 2
   pe: 2
   sg: 2
   be: 2
   lol: 1
   network: 1
   reisen: 1
   game: 1
   boats: 1
----------------------------------------
intersection: [ahundredphotos.com baltic-pipe-finansowy.blogspot.com baltic79.wordpress.com balticpipe.wordpress.com binomo.com bombardina.pl dopestore.pl faze13.com faze65.com faze67.com faze77.com faze88.com finnews7.wordpress.com jonausa.com maebl.com navi22.com navi47.com navi5.com navi61.com navi9.com navi90.com netcotto.com talentmaster.bio www.baltic-pipe-finansowy.blogspot.com www.baltic79.wordpress.com www.balticpipe.wordpress.com www.binomo.com www.bombardina.pl www.dopestore.pl www.faze67.com www.faze77.com www.faze88.com www.finnews7.wordpress.com www.jonausa.com www.maebl.com www.navi22.com www.navi5.com www.navi61.com www.navi9.com www.netcotto.com www.talentmaster.bio www.zooger.space zooger.space]
Intersection: 43 domains

@StevenBlack
Copy link
Owner

StevenBlack commented Feb 15, 2024

Assessing https://raw.githubusercontent.com/marcusminus/Orthrus-BlockList/master/hosts.txt

  • 280,142 domains
  • 605 unique TLD
  • 17% duplication rate
  • List is ALPHABETICIZED BY LEAST SUBDOMAIN which is ok, not great, for manual curation by the list's owners.
  • Heavy on China (.cn) TLD which is a very good thing.
  • Seems heavy on european country TLD, especially .it.
  • This list appears too large for our mission because it will cause an avalanche of problems for Windows OS users.
----------------------------------------
Base hosts file summary:
----------------------------------------
Location: https://raw.githubusercontent.com/StevenBlack/hosts/master/hosts
Domains: 152,013
Bytes: 4.6 MB
TLD tally:  (352 unique TLD)
   com: 67,516
   pl: 9,793
   site: 8,483
   net: 7,558
   top: 7,175
   xyz: 5,503
   click: 2,931
   info: 2,526
   online: 2,416
   live: 2,299
   app: 2,212
   org: 2,198
   cfd: 1,942
   sbs: 1,785
   br: 1,514
   shop: 1,509
   life: 1,486
   space: 1,242
   fr: 1,232
   eu: 1,059
   vn: 972
   io: 939
   uk: 811
   co: 793
   ru: 761
   pro: 732
   jp: 681
   pw: 592
   me: 573
   de: 531
   dev: 474
   store: 472
   cc: 422
   club: 410
   fun: 363
   link: 324
   cloud: 323
   website: 316
   cyou: 313
   us: 299
   nl: 288
   homes: 281
   quest: 276
   cn: 270
   it: 266
   tv: 242
   icu: 232
   biz: 227
   at: 164
   monster: 158
   ml: 146
   es: 137
   tech: 133
   buzz: 133
   in: 127
   lat: 124
   se: 121
   autos: 120
   cz: 116
   mobi: 109
   world: 102
   ca: 101
   one: 99
   sh: 97
   name: 96
   boats: 94
   au: 94
   beauty: 93
   skin: 92
   cam: 91
   ws: 88
   be: 88
   asia: 86
   pics: 86
   vip: 85
   page: 83
   ink: 82
   lk: 78
   hair: 78
   hu: 75
   pt: 70
   gd: 65
   ro: 63
   digital: 59
   bio: 59
   cl: 58
   dk: 54
   id: 53
   lol: 53
   ai: 52
   network: 51
   bond: 49
   wiki: 48
   zone: 48
   bid: 48
   best: 48
   ua: 48
   tw: 46
   su: 45
   goog: 44
   to: 41
   ir: 40
   tr: 40
   za: 39
   ch: 37
   fi: 36
   host: 35
   mx: 34
   makeup: 34
   uno: 34
   kr: 34
   rest: 34
   company: 34
   trade: 32
   yachts: 32
   ar: 31
   today: 29
   foundation: 29
   nz: 28
   care: 28
   guru: 27
   work: 26
   tk: 26
   gr: 25
   mom: 24
   art: 22
   im: 22
   software: 22
   la: 22
   help: 22
   st: 22
   ng: 21
   cm: 20
   bg: 20
   ph: 20
   sk: 19
   blog: 17
   cool: 16
   pk: 16
   no: 16
   ga: 15
   pet: 15
   win: 15
   pm: 14
   il: 14
   ag: 14
   lu: 14
   lt: 14
   stream: 13
   services: 13
   pe: 13
   sv: 12
   gay: 12
   by: 12
   my: 12
   kz: 11
   ug: 11
   ovh: 11
   gg: 11
   media: 11
   ltd: 10
   gt: 10
   sg: 10
   cx: 10
   bet: 10
   charity: 10
   lv: 9
   si: 9
   bar: 9
   am: 9
   news: 9
   ie: 8
   agency: 8
   support: 8
   ee: 8
   run: 8
   cf: 8
   ae: 8
   th: 8
   ke: 7
   rs: 7
   tn: 7
   nu: 7
   ba: 7
   solutions: 7
   casa: 7
   group: 7
   plus: 7
   gov: 6
   ly: 6
   fyi: 6
   gift: 6
   hk: 6
   gives: 6
   press: 6
   capital: 6
   do: 6
   edu: 6
   tube: 6
   np: 6
   academy: 6
   re: 5
   li: 5
   bd: 5
   hn: 5
   exchange: 5
   fund: 5
   ms: 5
   so: 5
   delivery: 5
   gq: 5
   sale: 4
   technology: 4
   trading: 4
   systems: 4
   energy: 4
   kim: 4
   video: 4
   mn: 4
   py: 4
   tattoo: 4
   ad: 4
   zw: 4
   marketing: 4
   promo: 4
   nf: 4
   al: 4
   games: 4
   gf: 4
   wtf: 4
   chat: 4
   team: 3
   gs: 3
   vision: 3
   pl-com: 3
   pub: 3
   mt: 3
   ac: 3
   gy: 3
   cat: 3
   cash: 3
   bz: 3
   mk: 3
   fm: 3
   email: 3
   mg: 3
   date: 3
   africa: 3
   social: 3
   eus: 3
   global: 3
   supply: 2
   international: 2
   accountant: 2
   ceo: 2
   studio: 2
   codes: 2
   ge: 2
   credit: 2
   land: 2
   download: 2
   ao: 2
   coffee: 2
   sl: 2
   engineer: 2
   kg: 2
   city: 2
   center: 2
   moscow: 2
   motorcycles: 2
   cafe: 2
   exposed: 2
   expert: 2
   hr: 2
   domains: 2
   sc: 2
   photography: 2
   rocks: 2
   ps: 2
   vc: 2
   dog: 2
   love: 2
   rw: 2
   auction: 2
   tips: 2
   aws: 2
   property: 2
   mr: 2
   dating: 2
   md: 2
   uy: 2
   sx: 2
   business: 2
   money: 2
   wang: 2
   mv: 2
   fans: 2
   pictures: 2
   lc: 2
   fo: 2
   review: 2
   style: 1
   bo: 1
   tools: 1
   finance: 1
   camp: 1
   ne: 1
   bw: 1
   gdn: 1
   tokyo: 1
   eg: 1
   lan: 1
   localdomain: 1
   jobs: 1
   financial: 1
   om: 1
   ma: 1
   report: 1
   ht: 1
   dz: 1
   works: 1
   science: 1
   watch: 1
   uz: 1
   vin: 1
   as: 1
   red: 1
   gold: 1
   porn: 1
   is: 1
   xn--p1ai: 1
   school: 1
   ki: 1
   mu: 1
   tc: 1
   pa: 1
   gl: 1
   vg: 1
   frl: 1
   sa: 1
   jo: 1
   photos: 1
   bnpparibas: 1
   ve: 1
   glass: 1
   ngo: 1
   direct: 1
   community: 1
   markets: 1
   cu: 1
   example0101: 1
   ong: 1
----------------------------------------
----------------------------------------
Compared hosts file summary:
----------------------------------------
Location: https://raw.githubusercontent.com/marcusminus/Orthrus-BlockList/master/hosts.txt
Domains: 280,142
Bytes: 8.5 MB
TLD tally:  (605 unique TLD)
   com: 147,388
   net: 19,851
   it: 18,343
   xyz: 12,214
   pl: 6,289
   ru: 4,236
   info: 3,948
   cn: 3,928
   top: 3,444
   site: 3,358
   io: 3,070
   org: 2,920
   de: 2,586
   fr: 2,196
   jp: 2,078
   eu: 1,541
   co: 1,470
   online: 1,341
   me: 1,323
   nl: 1,306
   uk: 1,265
   space: 1,247
   club: 1,105
   br: 1,094
   pro: 1,065
   vn: 989
   in: 952
   app: 919
   live: 919
   shop: 867
   biz: 818
   tv: 807
   au: 636
   es: 624
   life: 583
   tk: 537
   link: 529
   ca: 524
   ml: 524
   cf: 506
   us: 485
   website: 456
   fun: 446
   click: 440
   cc: 415
   se: 407
   dev: 407
   at: 404
   ga: 392
   cz: 377
   pw: 375
   store: 352
   icu: 347
   cyou: 343
   kr: 324
   digital: 324
   work: 313
   casa: 312
   tech: 299
   be: 285
   ch: 281
   monster: 277
   ua: 259
   im: 242
   mobi: 237
   one: 229
   buzz: 226
   id: 216
   cloud: 213
   uno: 211
   host: 206
   ro: 205
   mx: 198
   tr: 197
   cam: 192
   za: 190
   ir: 182
   dk: 178
   cl: 177
   pt: 171
   edu: 170
   lol: 164
   name: 161
   asia: 154
   hu: 147
   bar: 145
   ai: 142
   pics: 137
   rest: 134
   fi: 133
   ink: 131
   gq: 124
   win: 124
   nz: 123
   vip: 122
   ng: 121
   su: 119
   ar: 116
   no: 114
   delivery: 111
   tw: 110
   media: 109
   cfd: 108
   beauty: 106
   pe: 106
   surf: 105
   ltd: 104
   network: 104
   pk: 98
   to: 94
   sk: 91
   bid: 91
   bg: 90
   gr: 88
   my: 87
   world: 81
   sg: 79
   guru: 78
   il: 77
   by: 71
   sbs: 71
   mom: 67
   quest: 66
   ph: 66
   sh: 66
   ie: 66
   today: 64
   ws: 64
   la: 63
   hk: 62
   love: 57
   lt: 53
   ae: 53
   autos: 52
   lv: 52
   wang: 51
   best: 50
   ug: 50
   ke: 49
   homes: 47
   bond: 47
   art: 46
   am: 46
   th: 46
   kz: 46
   st: 46
   si: 43
   rs: 41
   ly: 41
   ovh: 41
   gg: 37
   page: 35
   ma: 34
   ee: 33
   trade: 33
   xn--p1ai: 32
   news: 31
   np: 31
   support: 30
   hr: 30
   services: 29
   fm: 29
   ge: 27
   design: 27
   stream: 27
   rocks: 27
   nu: 26
   gd: 25
   re: 25
   zone: 24
   ec: 24
   sa: 23
   gov: 23
   tn: 23
   lu: 23
   is: 22
   video: 22
   ad: 22
   loan: 21
   cool: 21
   gt: 21
   agency: 21
   plus: 21
   wiki: 21
   email: 21
   pet: 21
   pub: 20
   bd: 20
   systems: 20
   bet: 20
   az: 19
   uy: 19
   ms: 18
   ac: 18
   fyi: 18
   help: 18
   mk: 18
   bz: 18
   skin: 18
   cat: 17
   zw: 17
   group: 17
   so: 17
   do: 16
   press: 16
   global: 16
   eg: 16
   cm: 16
   blog: 15
   gift: 15
   ag: 15
   cx: 15
   care: 15
   vin: 15
   run: 15
   tz: 14
   cash: 14
   li: 14
   tokyo: 14
   hair: 14
   lk: 14
   xxx: 14
   download: 13
   qa: 13
   date: 13
   tl: 13
   sv: 13
   mz: 13
   ba: 12
   wtf: 12
   hn: 12
   md: 12
   cr: 12
   xin: 11
   al: 11
   mn: 11
   solutions: 11
   company: 11
   vu: 11
   foundation: 11
   goog: 11
   mg: 10
   codes: 10
   money: 10
   social: 10
   tools: 10
   uz: 10
   center: 10
   works: 10
   software: 9
   ao: 9
   tc: 9
   ht: 9
   aero: 9
   sncf: 9
   py: 9
   ninja: 9
   party: 9
   bo: 8
   studio: 8
   pm: 8
   mt: 8
   moe: 8
   technology: 8
   bi: 8
   ps: 8
   exchange: 8
   ren: 8
   gives: 7
   jo: 7
   vc: 7
   kg: 7
   pa: 7
   fit: 7
   review: 7
   team: 7
   rw: 7
   mm: 7
   promo: 7
   capital: 7
   xn--fiqs8s: 7
   af: 7
   red: 7
   gs: 6
   games: 6
   gl: 6
   academy: 6
   sale: 6
   watch: 6
   marketing: 6
   gdn: 6
   game: 6
   education: 6
   ci: 6
   tube: 6
   house: 6
   et: 6
   ve: 6
   makeup: 5
   events: 5
   boats: 5
   sb: 5
   style: 5
   as: 5
   business: 5
   auction: 5
   nf: 5
   kw: 5
   gy: 5
   porn: 5
   vg: 5
   ni: 5
   gay: 5
   bnpparibas: 5
   africa: 5
   sx: 5
   city: 5
   wf: 5
   coop: 4
   cv: 4
   credit: 4
   cd: 4
   gh: 4
   express: 4
   srl: 4
   001com: 4
   school: 4
   onion: 4
   vision: 4
   na: 4
   je: 4
   gifts: 4
   london: 4
   fashion: 4
   mu: 4
   health: 4
   bj: 4
   yt: 4
   finance: 4
   mw: 4
   webcam: 4
   racing: 4
   blue: 4
   consulting: 3
   ax: 3
   ooo: 3
   ne: 3
   vi: 3
   men: 3
   lighting: 3
   tips: 3
   contact: 3
   desi: 3
   iq: 3
   gold: 3
   sex: 3
   reviews: 3
   supply: 3
   tel: 3
   sc: 3
   tj: 3
   international: 3
   ky: 3
   clinic: 3
   ski: 3
   lb: 3
   cy: 3
   tt: 3
   dz: 3
   direct: 3
   gratis: 3
   dog: 3
   sn: 3
   leclerc: 3
   fans: 3
   chat: 3
   om: 3
   science: 3
   tf: 3
   market: 3
   pink: 3
   clothing: 3
   management: 3
   berlin: 3
   sr: 3
   mv: 3
   guide: 3
   photography: 3
   nyc: 3
   eus: 3
   earth: 3
   tg: 3
   abbott: 3
   fund: 3
   land: 3
   lat: 3
   inc: 2
   bike: 2
   photos: 2
   security: 2
   swiss: 2
   movie: 2
   pr: 2
   casino: 2
   baby: 2
   engineering: 2
   bauhaus: 2
   tm: 2
   exposed: 2
   zm: 2
   domains: 2
   build: 2
   boutique: 2
   kim: 2
   onl: 2
   army: 2
   bw: 2
   camera: 2
   kitchen: 2
   jobs: 2
   tax: 2
   saxo: 2
   immo: 2
   community: 2
   sd: 2
   rodeo: 2
   careers: 2
   moscow: 2
   vtt: 2
   photo: 2
   partners: 2
   ls: 2
   hamburg: 2
   int: 2
   komatsu: 2
   bs: 2
   ceo: 2
   coffee: 2
   holdings: 2
   ki: 2
   cards: 2
   sexy: 2
   travel: 2
   markets: 2
   report: 2
   ngo: 2
   cu: 2
   llc: 2
   lc: 2
   computer: 2
   energy: 2
   mba: 2
   accountant: 2
   dm: 2
   gallery: 2
   miami: 2
   kh: 2
   adult: 2
   archi: 2
   properties: 2
   church: 1
   basketball: 1
   canon: 1
   vast: 1
   dance: 1
   76: 1
   xn--p1acf: 1
   xy: 1
   pizza: 1
   rio: 1
   bot: 1
   jm: 1
   meet: 1
   xn--io0a7i: 1
   haus: 1
   weir: 1
   xn--90ais: 1
   js: 1
   fat1domain1: 1
   cologne: 1
   legal: 1
   navy: 1
   black: 1
   law: 1
   bh: 1
   xn--ngbrx: 1
   tld: 1
   sz: 1
   luxe: 1
   local: 1
   mr: 1
   kaufen: 1
   motorcycles: 1
   jetzt: 1
   deals: 1
   dental: 1
   gm: 1
   expert: 1
   beer: 1
   n3w1d0ma1n: 1
   rip: 1
   tl2: 1
   shoes: 1
   solar: 1
   pictet: 1
   loans: 1
   fishing: 1
   tienda: 1
   pf: 1
   contractors: 1
   bt: 1
   camp: 1
   capetown: 1
   builders: 1
   frl: 1
   xiaomi: 1
   bn: 1
   property: 1
   show: 1
   mq: 1
   xn--node: 1
   nrw: 1
   fj: 1
   coach: 1
   cg: 1
   politie: 1
   voto: 1
   gp: 1
   okinawa: 1
   sl: 1
   sky: 1
   place: 1
   toys: 1
   arab: 1
   td: 1
   ong: 1
   bm: 1
   tattoo: 1
   actor: 1
   mortgage: 1
   fitness: 1
   farm: 1
   glass: 1
   charity: 1
   gf: 1
   comm: 1
   ventures: 1
   aws: 1
   engineer: 1
   auto: 1
   directory: 1
   nr: 1
   college: 1
   koeln: 1
   sandvik: 1
   kpmg: 1
   bf: 1
   xn--mxtq1m: 1
   lgbt: 1
   gi: 1
   menu: 1
   cafe: 1
   family: 1
   cab: 1
   lib: 1
   car: 1
   ist: 1
   museum: 1
   financial: 1
   observer: 1
   radio: 1
   mp: 1
   sm: 1
   cw: 1
   kyoto: 1
   jcb: 1
   pictures: 1
   wedding: 1
   lan: 1
   ck: 1
   yachts: 1
   holiday: 1
   paris: 1
   pn: 1
   film: 1
   apple: 1
   poker: 1
   fan: 1
   football: 1
   equipment: 1
   scot: 1
   tours: 1
   moi: 1
   institute: 1
   cloudfront: 1
   pg: 1
   dj: 1
   bayern: 1
   healthcare: 1
   localdomain: 1
   bit: 1
   trading: 1
   investments: 1
   brussels: 1
   amsterdam: 1
----------------------------------------
Intersection: 47,568 domains

@StevenBlack
Copy link
Owner

(To be continued...)

@DiogoMiguelCunha
Copy link
Author

Thank you for the explanations and showing examples of how the files are compared using Ghosts.
I checked out Ghosts, but it's only available for Linux or Darwin, and I'm using Windows. I even checked out Window's WSL to try it out, but reading the reviews from Linux users, unless you know exactly what you're doing, WSL can be a pain to deal with, and hog resources.

Back to the lists, it makes a lot of sense that long-time curators wouldn't miss much.
Too many Hosts on Windows does cause an avalanche of problems, as you said. I use HostMan to clean and rearrange, but I've had to reboot in safe mode more than once because I was playing around and the list got too long and being constantly accessed. That's why I decided to just go with yours, and add whatever else on uBlock.

I hope you'll be able to extract something useful from these lists!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants