Website
.
Fuselkönig.de
.
Projects
.
Status
index
:
fk_crawler
master
The crawler which powers kategorischeraperitif.de/angebote
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
crawler
Age
Commit message (
Expand
)
Author
Files
Lines
2019-05-14
Refactores logging functions. (crawler)
Maximilian Möhring
13
-102
/
+101
2019-05-14
Changes imports to lower case. (crawler)
Maximilian Möhring
10
-10
/
+10
2019-05-12
Minor fix. (crawler)
Max
1
-0
/
+7
2019-05-10
Improves edge case in spirit type detecting. (crawler)
horus
1
-0
/
+8
2019-05-07
Improves detection of different spirit types and removes a lot of dead code.
horus
2
-126
/
+89
2019-05-07
Better spirit type detection. (crawler)
horus
3
-8
/
+117
2019-04-16
Rewrites crawler for drankdozijn. (crawler)
horus
1
-280
/
+131
2019-01-12
Better handling for nil interfaces. (crawler)
horus
1
-26
/
+36
2019-01-11
Crawler for Drankdozijn now uses the official API. (crawler)
Max
1
-160
/
+365
2018-09-16
Enhances crawler for Rum & Co. (crawler)
horus
1
-7
/
+46
2018-09-16
Fix crawler for Rum & Co. (crawler)
horus
1
-3
/
+7
2018-09-16
Fix champagne offerings for Drankdozijn. (crawler)
horus
1
-2
/
+2
2018-09-16
Fix crawler for Drankdozijn. (crawler)
horus
1
-2
/
+14
2018-09-16
Rename flags. Adds new flag to exclude shops. (crawler)
horus
3
-12
/
+28
2018-09-16
Bug fix. (crawler)
horus
1
-1
/
+2
2018-07-09
Improves name sanitizing. (crawler)
Max
1
-8
/
+16
2018-06-16
Corrects short url. (crawler)
horus
1
-1
/
+1
2018-06-16
Bugfix. (crawler)
horus
1
-3
/
+14
2018-06-16
Adds champagner / Drankdozijn. (crawler)
horus
1
-2
/
+8
2018-06-16
Adds scraper for Drankdozijn. (crawler)
horus
3
-0
/
+202
2018-06-16
Removes validating abv based of spirit type. (crawler)
horus
1
-12
/
+14
2018-06-16
Detects cl in sanitize_name(). (crawler)
horus
1
-1
/
+1
2018-06-16
Improves sanitizing function. (crawler)
horus
1
-2
/
+8
2018-06-16
Adds support for cl. (crawler)
horus
1
-3
/
+25
2018-06-15
Removes unnecessary code. (crawler)
horus
2
-9
/
+0
2018-06-15
Introduces config for user agent, robots.txt and crawler delay. (crawler)
horus
12
-26
/
+109
2018-06-15
Introduces central crawler config. (crawler)
Max
9
-32
/
+14
2018-06-15
Improves debugging output. (crawler)
Max
1
-0
/
+1
2018-06-15
Tries to validate image url by making head request. (crawler)
horus
1
-0
/
+26
2018-06-15
Fix because changed html. (crawler)
horus
2
-1
/
+5
2018-05-23
Reflects changes in TWS url structure. (crawler)
horus
1
-1
/
+1
2018-05-23
Repairs --list-shops flag. (crawler)
horus
1
-1
/
+1
2018-05-19
Bugfix. (crawler)
horus
1
-1
/
+1
2018-05-19
Bugfix. (crawler)
horus
2
-2
/
+3
2018-05-14
Renames command line flags. Suppress unhelpful error message. (crawler)
Max
1
-4
/
+8
2018-05-14
Adds command line support to crawl only specific shops. (crawler)
horus
1
-0
/
+5
2018-05-14
Bugfix. (crawler)
horus
1
-1
/
+1
2018-05-14
Add feature to list all crawlable shops. (crawler)
Max
2
-0
/
+20
2018-05-14
Various fix, e.g. it repairs wrong image urls. (crawler)
horus
7
-20
/
+49
2018-02-21
Changes shop name from Whiskysite.nl to Whiskysite. (crawler)
horus
1
-1
/
+1
2018-02-21
Adds some docu. (crawler)
horus
1
-4
/
+7
2018-02-21
Prevents panic when passing a non-existent path as config file. (crawler)
horus
1
-2
/
+7
2018-02-20
Reintroduces debug as a config setting. (crawler)
horus
2
-9
/
+19
2018-02-20
Refactoring + adds a more granular log level setting. (crawler)
horus
5
-54
/
+78
2018-02-20
Removes Tx, because I get 'busy buffer' error. (crawler)
horus
3
-32
/
+12
2018-02-20
Better error reporting. (crawler)
horus
1
-2
/
+3
2018-02-20
Bugfix. (crawler)
horus
3
-14
/
+15
2018-02-20
Bugfix. (crawler)
horus
2
-5
/
+10
2018-02-19
Better error reporting. (crawler)
horus
1
-4
/
+9
2018-02-19
Less Fatal(), more Warn(). (crawler)
horus
3
-4
/
+8
[next]