From 5c226b3bca517b198fed4af10597c803554944e8 Mon Sep 17 00:00:00 2001 From: Dan Schaper Date: Tue, 23 Jun 2020 10:32:49 -0700 Subject: [PATCH] Destroyed Customising Sources for Ad Lists (markdown) --- Customising-Sources-for-Ad-Lists.md | 73 ----------------------------- 1 file changed, 73 deletions(-) delete mode 100644 Customising-Sources-for-Ad-Lists.md diff --git a/Customising-Sources-for-Ad-Lists.md b/Customising-Sources-for-Ad-Lists.md deleted file mode 100644 index d522703..0000000 --- a/Customising-Sources-for-Ad-Lists.md +++ /dev/null @@ -1,73 +0,0 @@ -# TO BE REMOVED - -## Pi-hole's Default Block Lists - ->**UPDATE:** As of version 3.0, adlist management can now be done via the web UI, you no longer need to touch the command line for this! You can find it under **Settings > Pi-Hole's Block Lists**. -> ->![](https://i.imgur.com/YD2k12x.png) - - -By default, when `pihole -g` pulls in lists of domains to block, we combine several lists, which are defined in [`/etc/pihole/adlists.list`](https://github.com/pi-hole/pi-hole/blob/master/adlists.default): - -```text -# The below list amalgamates several lists we used previously. -# See `https://github.com/StevenBlack/hosts` for details -##StevenBlack's list -https://raw.githubusercontent.com/StevenBlack/hosts/master/hosts - -##MalwareDomains -https://mirror1.malwaredomains.com/files/justdomains - -##Cameleon -http://sysctl.org/cameleon/hosts - -##Zeustracker -https://zeustracker.abuse.ch/blocklist.php?download=domainblocklist - -##Disconnect.me Tracking -https://s3.amazonaws.com/lists.disconnect.me/simple_tracking.txt - -##Disconnect.me Ads -https://s3.amazonaws.com/lists.disconnect.me/simple_ad.txt - -##Hosts-file.net -https://hosts-file.net/ad_servers.txt -``` - -### How To Parse A List To Get Just The Domain - -Imagine you found a list you want to use, but it is formatted with a bunch of extra characters: - -``` -||unlimited-hacks.net^ -||pakcircles.com^ -||cracksplay.com^ -||fbgamecheatz.info^ -||linkz.it^ -``` - -You can use [`sed`](https://linux.die.net/man/1/sed) and/or [`awk`](https://linux.die.net/man/1/awk) (or other commands) to remove the extra characters to get just the domain name. It helps to be familiar with scripting, but if you wanted to parse down the list above, you could do something like this: -``` -curl -s http://some.list | sed 's/^||//' -``` -This would remove the two pipes at the beginning of the lines, so your list would then look like this: -``` -unlimited-hacks.net^ -pakcircles.com^ -cracksplay.com^ -fbgamecheatz.info^ -linkz.it^ -``` -Then, you could use `sed` again, or even something like [`cut`](https://linux.die.net/man/1/cut). Since the domains won't have a carat in the name, you can use it as a delimiter with the `cut` command to display only the domain name. -``` -curl -s http://some.list | sed 's/^||//' | cut -d'^' -f-1 -``` -Which leaves you with just the domain names: -``` -unlimited-hacks.net -pakcircles.com -cracksplay.com -fbgamecheatz.info -linkz.it -``` -There is more than one way to parse the list down and there is no right way, however, some methods are faster. If you can combine most of your parsing into a single `awk` command, it can process a large list much faster. For each `|` that you use in the command, you are slowing down the processing as it is running in another subshell. \ No newline at end of file