Commit Graph

37 Commits

Author SHA1 Message Date
timmy 260a13688a Wait every 200 downloads so CV doesn't get overloaded 2024-12-26 17:50:25 -08:00
timmy e04469938d Fix downloading images that are not hashed 2024-12-26 16:11:10 -08:00
timmy 8bdfb282b9 Only create directories if we are keeping images 2024-12-25 15:22:37 -08:00
timmy 9baa4dbc17 Add RmdirP 2024-12-25 15:11:11 -08:00
timmy 5cf25089da Improve downloading images 2024-12-25 14:53:56 -08:00
timmy cf2c61b92e Update pre-commit 2024-12-25 14:49:22 -08:00
timmy cc4e973bf9 Make deleting files efficient 2024-12-15 14:15:33 -08:00
timmy 7ede0dee72 Fix file closing 2024-10-31 21:12:53 -05:00
timmy 840e97cff0 Handle reading existing invalid json 2024-10-19 09:50:23 -07:00
timmy 50fcfb9513 Remove downloaded result on json failure 2024-10-19 09:41:42 -07:00
timmy 776ca68e3e Wait for page download failures 2024-10-19 09:39:08 -07:00
timmy 95fa44fb97 Fix mutex on badURLs 2024-10-18 14:35:29 -07:00
timmy d0c7ed792a Fix infinitely increasing url length 2024-10-18 14:30:30 -07:00
timmy 97802d9111 Start cover download immediately 2024-10-18 11:48:06 -07:00
timmy 89c9b4ebce Allow truncating image files to save space 2024-10-18 11:19:50 -07:00
timmy a65cadf106 Add CV cover downloader 2024-10-18 10:50:35 -07:00
timmy 4922ceb678 Fix locking for map storage 2024-10-16 17:56:19 -07:00
timmy da54b3a454 Fix decoding hashes 2024-10-14 02:02:26 -07:00
timmy f560b7f428 Fix loading empty hashes 2024-10-13 22:18:51 -07:00
timmy 87c1a69b49 Remove unused Path attribute 2024-10-13 22:16:37 -07:00
timmy dbf03d258c Fix encoding hashes 2024-10-13 22:14:42 -07:00
timmy 095c78f0e7 Optimize iterating hashes 2024-09-12 11:42:29 -07:00
timmy 0928ed6ccf Optimize memory usage
Add a basic map storage that does manual searches to conserve memory
Change saved hashes format to allow multiple hashes for a given ID
Add a vptree storage

Maps in Go take up a huge amount of space changing IDList to []ID took
  memory from over 1GB down to 200MB (note this was on aarch64 MacOS
  which for some reason uses less memory than aarch64 Linux).
  Exhaustive searches using slices took about 30 ms search now takes
  50-60 ms as it takes longer to iterate a map. Partial hashes will
  speed up searches to 8 ms at the cost of 700MB initial memory usage
  and 400MB idle (though this is on MacOS, which for some reason uses
  less memory that aarch64 Linux so probably more like
  900MB initial -> 600 MB idle on an RPI running Linux)
2024-09-07 14:51:18 -07:00
timmy b1de95021a Add cli flag 2024-09-02 15:35:36 -07:00
timmy 1955444dcf Add sqlite implementation 2024-09-01 18:14:19 -07:00
timmy 0069ffd5cb Make runtime hash storage modular 2024-09-01 18:13:47 -07:00
timmy 007a726764 Remove massive html file 2024-08-11 22:55:14 -07:00
timmy d730de8fe1 Finish POC
Make a pip installable package for quick-tag
Save and load hashes from msgpack/json
Embed hashes of all thumb_urls from ComicVine
Allow associating new IDs to old IDs
Support multiple domains
Support hashing local covers
Support simple list of ids and distance and results including the hash
2024-08-11 20:46:41 -07:00
timmy a9630ac31e Finish implementing quick_tag with simple results 2024-08-10 19:46:53 -07:00
timmy 2cbbaa0d65 Update pre-commit 2024-08-10 19:26:09 -07:00
timmy df2906ac59 Add an option to return simple results 2024-08-05 13:54:00 -07:00
timmy 4a7e61e189 static check fixes 2024-08-04 18:26:43 -07:00
timmy e2082465c6 more stuff 2024-08-04 18:12:00 -07:00
timmy c61ade9961 stuff 2024-07-31 11:35:17 -07:00
timmy 4992f13ac5 add output from inital image hash comparison 2024-05-10 14:12:39 -07:00
timmy 5de0e78ee6 Add debug output for go and python 2024-05-10 13:58:04 -07:00
timmy ce71c2f66b Initial Commit 2024-05-01 18:09:02 -07:00