i try to defend google ads click fraud with analyzing how many clicks an user do with one IP-address.
For these i installed goacess on my ubuntu 18.04 lts.
The landing page for google ads traffic is "domain.de/shop/productxy?traffic=ads".
With these filter i see one hit per click - these will be ok:
tail -f access_ssl_log | grep -i --line-buffered 'GET /shop/produkt-kaufen?traffic=ads' | goaccess --log-format=COMBINED
These are stored in the access_log when an user comes to the landing page:
2022-12-20 10:08:59 109.42.114.xx - - "GET /shop/produkt-kaufen?traffic=ads&gclid=EAIaIQobChMI2cLVpOyH_AIV1obVCh1-UwEKEAAYASAAEgKRV_D_BwE HTTP/1.0" 200 12827 "https://www.google.de/" "Mozilla/5.0 (Linux; Android 11; M2102J20SG) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/106.0.5249.126 Mobile Safari/537.36 OPR/72.5.3767.69342"
Now i need to setup an script which can executed daily by cronjob.
These script won´t work (the website are ok, i see statics, but i think i see all data with exclusion from the data i search for):
#!/bin/sh
set -x
cd /projects/stats/
TARGETDIR=/var/www/vhosts/nrw-carcoding.de/nrw-carcoding/analyze
WORKDIR=/projects/stats/work
for DOMAIN in nrw-carcoding.de
do
echo Create Stats for $DOMAIN
LOGS=/var/www/vhosts/nrw-carcoding.de/logs/access_ssl_log
rm $WORKDIR/*.*
cp $LOGS* $WORKDIR
gzip -d work/*.gz
cat work/access_ssl_log.* | \
grep -v "GET /shop/bmw-fussraummodul-frm3-reparatur-e81-e87-e90-e91-e92-kaufen?traffic=ads" | \
goaccess -o /var/www/vhosts/nrw-carcoding.de/nrw-carcoding/analyze/$DOMAIN.html --log-format=COMBINED
done
big thanks.