summaryrefslogtreecommitdiff
AgeCommit message (Expand)AuthorFilesLines
5 hoursfeat: add license_short from Wikipedia license identifiermaindev2-3/+8
5 hoursfix: only count awards tables in num_accoladesdev1-0/+18
9 hoursfeat: add three-level logging with per-request debug outputdev5-47/+135
9 hourschore: sync schema.sql with live databasedev1-1/+9
10 hoursfix: add nil checks in extractPeople for missing infobox/section datadev1-14/+19
10 hoursfeat: extract actors, directors, screenwriters from Wikipedia APIdev2-7/+184
19 hoursfix: prevent dropped wiki entries when channel fillsdev1-13/+4
20 hoursfeat: add -wiki-only flag to rerun only wiki data extractiondev2-11/+46
20 hoursrefactor: pipeline SPARQL and wiki data in paralleldev3-166/+169
20 hours.dev2-8/+1
21 hoursrefactor: decode wiki_article names once in DB, encode on senddev1-4/+0
21 hoursfix: decode wiki article names for clean storagedev2-4/+7
21 hoursfix: avoid double URL-encoding of wiki article namesdev1-4/+11
22 hoursfeat: track wiki_status_code and skip 404 entries on rerundev1-23/+47
26 hoursfix: add 429 retry with exponential backoff and increase rate limit delaydev1-9/+32
26 hoursfeat: fetch missing wiki data from custom server and populate imdb tabledev3-0/+291
26 hoursfix: skip already-classified entries in wikidata querydev1-1/+1
27 hoursfeat: set has_no_wiki_article flag for entries without Wikipedia articledev1-13/+34
27 hoursfeat: fetch Wikipedia article titles via Wikidata SPARQLdev3-1/+243
3 daysfix: use INSERT IGNORE for imdb_genre to handle re-runsdev1-1/+1
3 daysfeat: adapt genre code for n:m relation via imdb_genredev2-15/+73
3 daysfeat: populate genre table from title.basics.tsvdev1-11/+47
3 daysfix: correct TSV parsing — use line-by-line reader and proper column indicesdev1-30/+57
3 dayschore: delete .gz files after extracting in downloadImdbDatasetsdev1-0/+3
3 daysmove download pathdev1-1/+1
3 daysfeat: fetchAndUpdateImdbData — download IMDB datasets and populate imdb tabledev2-0/+352
3 daysfeat: populate imdb table with unique title IDs from linksdev1-0/+91
3 daysfeat: extract IMDB title IDs from links URLs into param fielddev3-15/+87
3 daysfeat: add AccessToken back to Config struct (json:"-" to exclude from seriali...dev1-0/+1
3 dayschore: remove access_token from config (calculated by program)dev2-4/+0
3 daysfeat: switch config to JSON; add go.mod and config.json.exampledev4-88/+57
3 dayschore: commit existing config.go changesdev1-1/+2
3 daysChanges to config.godev1-0/+1
3 daysInitial commitdev5-0/+259