PI-Hole not only for blocking adverts! Don't overlook this project for your network

E-waste · July 27, 2022, 10:56pm

I used to see Pi-Hole and people would talk about blockimg ads on pandora and such and I felt that it was just used to almost steal content. I get NOBODY enjoys ads, but to advocate skipping ads on still free content REALLY bothered me and I figured pi-hole wasn’t my style.

Total opposite, wish I started using it when I read about it in 2014 or so maybe a bit later. I can’t or don’t want to imagine all the wasted network energy sending data to Google, Microsoft, Netflix, Amazon, Roku, and hundreds of other companies, wow.

Just use it.

TimHolus · July 28, 2022, 2:23am

I have been using for years :), I have two boxes in Lan. It is essential to have a good set of lists…
My piholes is hovering around 2,803,319.

https://dbl.oisd.nl	
https://raw.githubusercontent.com/StevenBlack/hosts/master/hosts	
https://raw.githubusercontent.com/StevenBlack/hosts/master/alternates/fakenews/hosts	
https://raw.githubusercontent.com/StevenBlack/hosts/master/alternates/gambling/hosts	
https://someonewhocares.org/hosts/zero/hosts
https://blocklistproject.github.io/Lists/abuse.txt	
https://blocklistproject.github.io/Lists/ads.txt	
https://blocklistproject.github.io/Lists/crypto.txt	
https://blocklistproject.github.io/Lists/drugs.txt	
https://blocklistproject.github.io/Lists/fraud.txt	
https://blocklistproject.github.io/Lists/gambling.txt	
https://blocklistproject.github.io/Lists/malware.txt	
https://blocklistproject.github.io/Lists/phishing.txt	
https://blocklistproject.github.io/Lists/ransomware.txt	
https://blocklistproject.github.io/Lists/redirect.txt	
https://blocklistproject.github.io/Lists/scam.txt	
https://blocklistproject.github.io/Lists/tiktok.txt	
https://blocklistproject.github.io/Lists/tracking.txt	
https://www.github.developerdan.com/hosts/lists/ads-and-tracking-extended.txt	
https://www.github.developerdan.com/hosts/lists/amp-hosts-extended.txt	
https://www.github.developerdan.com/hosts/lists/dating-services-extended.txt	
https://www.github.developerdan.com/hosts/lists/tracking-aggressive-extended.txt	
https://raw.githubusercontent.com/FadeMind/hosts.extras/master/add.Spam/hosts	
https://v.firebog.net/hosts/static/w3kbl.txt	
https://adaway.org/hosts.txt	
https://v.firebog.net/hosts/AdguardDNS.txt	
https://v.firebog.net/hosts/Admiral.txt	
https://raw.githubusercontent.com/anudeepND/blacklist/master/adservers.txt	
https://s3.amazonaws.com/lists.disconnect.me/simple_ad.txt	
https://v.firebog.net/hosts/Easylist.txt	
https://pgl.yoyo.org/adservers/serverlist.php?hostformat=hosts&showintro=0&mimetype=plaintext	
https://raw.githubusercontent.com/FadeMind/hosts.extras/master/UncheckyAds/hosts	
https://raw.githubusercontent.com/bigdargon/hostsVN/master/hosts	
https://v.firebog.net/hosts/Easyprivacy.txt	
https://v.firebog.net/hosts/Prigent-Ads.txt	
https://raw.githubusercontent.com/FadeMind/hosts.extras/master/add.2o7Net/hosts	
https://raw.githubusercontent.com/crazy-max/WindowsSpyBlocker/master/data/hosts/spy.txt	
https://hostfiles.frogeye.fr/firstparty-trackers-hosts.txt	
https://raw.githubusercontent.com/DandelionSprout/adfilt/master/Alternate%20versions%20Anti-Malware%20List/AntiMalwareHosts.txt	
https://osint.digitalside.it/Threat-Intel/lists/latestdomains.txt	
https://s3.amazonaws.com/lists.disconnect.me/simple_malvertising.txt	
https://v.firebog.net/hosts/Prigent-Crypto.txt	
https://raw.githubusercontent.com/FadeMind/hosts.extras/master/add.Risk/hosts	
https://bitbucket.org/ethanr/dns-blacklists/raw/8575c9f96e5b4a1308f2f12394abd86d0927a4a0/bad_lists/Mandiant_APT1_Report_Appendix_D.txt	
https://phishing.army/download/phishing_army_blocklist_extended.txt	
https://gitlab.com/quidsup/notrack-blocklists/raw/master/notrack-malware.txt	
https://raw.githubusercontent.com/Spam404/lists/master/main-blacklist.txt	
https://raw.githubusercontent.com/Te-k/stalkerware-indicators/master/generated/hosts	
https://urlhaus.abuse.ch/downloads/hostfile/	
https://zerodot1.gitlab.io/CoinBlockerLists/hosts_browser

Zedicus · July 28, 2022, 3:32am

my list

https://s3.amazonaws.com/lists.disconnect.me/simple_tracking.txt	

https://s3.amazonaws.com/lists.disconnect.me/simple_ad.txt	

https://dbl.oisd.nl/	

https://phishing.army/download/phishing_army_blocklist_extended.txt	

https://raw.githubusercontent.com/evankrob/hosts-filenetrehost/master/ad_servers.txt	

https://adaway.org/hosts.txt	

https://v.firebog.net/hosts/AdguardDNS.txt	

https://raw.githubusercontent.com/anudeepND/blacklist/master/adservers.txt	

https://v.firebog.net/hosts/Easylist.txt	

https://pgl.yoyo.org/adservers/serverlist.php?hostformat=hosts&showintro=0&mimetype=plaintext	

https://raw.githubusercontent.com/FadeMind/hosts.extras/master/UncheckyAds/hosts	

https://raw.githubusercontent.com/bigdargon/hostsVN/master/hosts	

https://raw.githubusercontent.com/deathbybandaid/piholeparser/master/Subscribable-Lists/ParsedBlacklists/Prebake-Obtrusive.txt	

http://sysctl.org/cameleon/hosts

https://raw.githubusercontent.com/jdlingyu/ad-wars/master/hosts

risk · July 28, 2022, 6:08am

This is the thing with ads on the internet, they use energy and network your local device compute and ram and storage as well as remote number crunching compute and ram and storage, mostly remote though but you probably don’t get to see that part. Some but not all of it is paid for by companies who want to sell you some product or put some idea out in front of you. Often, that money also pays for the “property”/“publisher”/“sell side” business, like Pandora or Twitter or Google Search, or others.

The better the “targeting” and ad selection algorithm, the more value the business-es like Pandora or news websites will get per Ad, and more revenue per published content.

The trouble is privacy and that ad-free alternatives are hard to setup in an easy to monetize way, and some websites and services still show you ads even after you pay for the service and there’s no ad-free option.

E-waste · July 28, 2022, 3:45pm

Set your pi-hole DNS to a null IP and run your devices through it.

Block all unwanted connections and set it up functionally again.

Add to blocklist as needed, I only have less than 120,000 adlist domains, I’ll share some tracking / mobile advert sections, 2000000 is too much.

ulzeraj · July 28, 2022, 3:59pm

I use a simple unbound configuration file instead of Pihole.

#!/bin/sh
rm -f /tmp/badsites
fetch -o /tmp/badsites https://raw.githubusercontent.com/StevenBlack/hosts/master/hosts
if [ -f /tmp/badsites ]; then
    grep '^0\.0\.0\.0' /tmp/badsites | awk '{print "local-zone: \""$2"\" redirect\nlocal-data: \""$2" A 0.0.0.0\""}' > /usr/local/etc/unbound/badsites.conf
    rm /tmp/badsites
fi

Replace fetch with curl on Linux.

E-waste · July 28, 2022, 4:32pm

I like to see all the domains and stuff in the list, go back a few days and check what still works etc.

I like being able to have adlists for specific devices because some domains operate on multiple devices but for different reasons, namely MS domains. So I can block those on desktop, and let one or two through on a different system.

Later I’ll share some of my shorter blocklists and custom blacklist entries for common mobile operating systems.

2,000,000 domains blocks is too much, I don’t see the advantage of blocking websites you won’t visit.

Some examples I have are blocking mobile advertising as many of it could contain malware or just send back to google or otherwise, just buy the program and block the adware.

Shadowbane · July 28, 2022, 6:03pm

Hi everyone, I thought since we are talking Pi-hole and privacy, I thought this video was a perfect video to post.
https://www.youtube.com/watch?v=HYNZKQZEHoU

jlittle · July 28, 2022, 10:25pm

What’s the advantages of a pi-hole over using uBlock Origin?

aLilBabyOtter · July 28, 2022, 11:17pm

It’s network-wide instead of per device.

TimHolus · July 29, 2022, 7:54am

I do not agree… It is not too much, in my opinion it is just right. It’s about quality, not just quantity. Blocking at the dns level has little to do with what websites you visit. The thing is that it is supposed to block those addresses that you are not very aware of. I have no problem reaching 20% of blocked queries on my lists.

You can think of it in a similar way to virus signature databases. The large database is not a problem… in this database 99% of the signatures will never apply in your case. But this does not mean that such a base should be limited to only the group that has a % probability of occurrence for you.

Likewise with adlist… if you use only good quality adlist the amount will be balanced with the quality in good percentages. Nobody is saying that you should go for the quantity and use 50 million records from any larger adlist. Excess with poor quality will of course lead to a false positive.

I personally have no false positives. I do not use adlist on the basis of as many domains as possible, only in terms of adlist quality and reputation.

I have been doing this for years and I am happy, but if you prefer another variant, it is good for you.

E-waste · August 1, 2022, 3:00pm

Just go set it up, takes about 10 minutes or so, great documentation at the main

pi-hole.net and click doc.

I think you’ll be surprised how much is loading and connecting.

shadragon · August 1, 2022, 5:22pm

My network interface (when the PC is idle) looks like the cardiac rhythm of a speed addict. Whatever I can do to stop the data slurp is fine by me.

E-waste · August 1, 2022, 7:39pm

A quicker thing to do would be to install and run WireShark, one of the most respected network analysis tools out there, and it’s free, and open-source. You can’t do much better than this one.

You’ll likely (very likely) want to temporarily filter Router Advertisements, so go into the filters, and only show the following ports:

53 (DNS, the phone-book of the Internet)
80 (non-encrypted HTTP traffic)
443 (encrypted HTTPS)

and any others that may be relevant to you. This way you won’t see all the “Who has 192.168.1.#” > Tell 192.168 etc.

Then, when you do get into Pi-Hole, you can see instead of IP addresses, you’ll see the website names instead, and you can block what you don’t want.

As a side-note, I would like to recommend running your own DNS server, Unbound. It’s amazing, and I believe it has built-in security, and it goes directly to the main DNS record-keepers of the Planet, no intermidiate, third-parties. I think that’s how it works.

efahl · August 1, 2022, 9:28pm

If you run unbound without any specified upstreams, then yes it defaults to the root servers and recursively resolves addresses from authoritative sites (a.root-servers.net and b. and c…). But, even if you use DNSSEC, you’ve only assured yourself that the results are valid, you have not done anything to ensure privacy (all your queries to the root servers are authenticated, but not encrypted, so your local ISP - and any intermediate nodes - can watch everything you do).

So, what to do if you want unbound to ensure privacy? You need to set up DoT (DNS over TLS) and use a different upstream than the root servers. I recommend Quad9 as the upstream, their whole model is derived around privacy and proving that they mean what they say (they moved from the US to Switzerland because the Swiss privacy laws are MORE restrictive and demanding, just to prove a point). Quad9 also has several services (via different IPs), the standard 9.9.9.9 one I show below does Pi-Hole-like filtering of known malicious sites (see their descriptions at quad9.net), but you can also get unfiltered/unsecure results (9.9.9.10) and so on.

Here’s my config for using their servers in unbound:

# Forward zones over TLS
server:
  tls-cert-bundle: /etc/ssl/cert.pem

forward-zone:
  name: "."
  forward-tls-upstream: yes
  forward-addr: 9.9.9.9@853#dns.quad9.net
  forward-addr: 149.112.112.112@853#dns.quad9.net
  forward-addr: 2620:fe::fe@853#dns.quad9.net
  forward-addr: 2620:fe::9@853#dns.quad9.net

You’ll have to figure out the certs for your particular configuration to make that all work, depends a lot upon the deployment platform (BSD, Linux, Windows, whatever).

That said, I use unbound as my upstream for Pi-Hole, so Pi-Hole does the DNS filtering, but unbound does privacy (all DNS traffic over DoT, using DNSSEC to authenticate).

TimHolus · August 1, 2022, 9:48pm

I will add my own approach to the matter…

DoH over DoT.
My piholes use cloudflared (Cloudflare Tunnel client). NS1 uses 1.1.1.1 and 1.0.0.1 while NS2 uses Quad9 as upstream.

DNSSEC… in theory good, in practice 99% of internet resources do not use it. In effect, it becomes more of a placebo effect for well-being, rather than an actual safeguard.

E-waste · August 1, 2022, 9:57pm

Why should I avoid using the root servers? I think that would be the most accurate path to go through. It’s why I set up unbound, figured it would be the safest way to get to the correct place. So would going through quad9 mask my query or ensure some layer of cryptography attached to my dns request?

I read that Quad9 openly recommends DNS caching and probably other cool things, and now I learn that they adhere to strict standards, seems like a cool organization / business.

TimHolus · August 1, 2022, 10:03pm

The DNS root level is the highest in the DNS hierarchy tree because it is the first step in resolving a domain name. The root DNS server is the DNS for the root zone. It handles requests for records in the root zone and answers other requests by providing lists of authoritative name servers for the appropriate TLD (top-level domain). These are the authoritative nameservers that serve the DNS root zone. These servers contain the global list of the top-level domains. The root zone contains the following:
Organizational hierarchy  such as .com, .net, .org, .edu.

Geographic hierarchy  such as .ca, .uk, .fr, .pe.
Currently, there are 13 root name servers specified, with logical names in the form letter.root-servers.net, where letter ranges from A to M and represent companies like Verisign, University of Maryland, NASA, and The Internet Corporation for Assigned Names and Numbers (ICANN).

Trooper_ish · August 2, 2022, 7:53am

So why avoid them?
Do they lack DOH/DOT?

TimHolus · August 2, 2022, 11:30am

Is that a question for me?

Avoid root servers??? The hierarchy works a little differently and I would not define it in the matter of avoiding or not avoiding because it is above the level of concern for user.

The root servers, like most authoritative servers, are configured to not do recursive resolution, which is what your ISPs DNS servers are set up for.

Theres a good reason for that: Back in 2001, when OReilly published DNS and BIND, 4th Edition, they were already handling thousand of queries per second. I cant imagine that they handling less traffic now.

So, theyll instead cheerfully tell you which DNS servers to ask next, expecting you to iteratively ask one DNS server after another in the chain, until you get an answer from the one who actually holds the informationor can tell you definitively no such thing exists. This is what your ISPs servers are doing behind the scenes, when your PC asks them whats the IP address of www.microsoft.com?