Commit Graph

42 Commits

Author SHA1 Message Date
dgtlmoon
f215adbbe5 CSS Filter - Smarter is to just extract the HTML blob and continue with inscriptus, so we have almost the same output as not using the filter 2021-06-23 20:40:01 +10:00
dgtlmoon
8d59ef2e10 CSS Filter - restore nicer linefeeds 2021-06-23 12:52:04 +10:00
dgtlmoon
e3a9847f74 @todo Comment - BS4's element.get_text() seems to lose the indentation format no-matter what 2021-06-23 12:49:53 +10:00
dgtlmoon
47f7698b32 CSS Filter - strip text of whitespacing, preserve new lines where applicable, remove extra newlines 2021-06-23 12:29:14 +10:00
dgtlmoon
854520005d #81 - Regex support (#90)
* Re #81 - Regex support
* minor cleanup
2021-06-21 17:17:22 +10:00
Leonardo Brondani Schenkel
cec45a7ad7 Strip surrounding whitespace from elements (#89) 2021-06-16 13:57:22 +10:00
dgtlmoon
2346b42ef2 CSS selector filter (#73)
* Re #9 CSS Selector filtering,  Adding test for #9
2021-05-30 21:22:26 +10:00
Leigh Morresi
e0578acca2 Tidy up thread logic and version check 2021-03-01 14:29:21 +01:00
Leigh Morresi
47fcb8b4f8 Move logic 2021-02-27 22:01:42 +01:00
Leigh Morresi
f1da8f96b6 When new ignore text is specified, reprocess the checksum 2021-02-27 20:30:06 +01:00
Leigh Morresi
468184bc3a Issue #14 - Tweaks to edit, create ignore text, tests for ignore text, integrate ignore text 2021-02-26 20:07:26 +01:00
Leigh Morresi
96221598e7 Tidy up return logic 2021-02-21 20:23:50 +01:00
Leigh Morresi
e200cd3289 Fixing a few more easy lint wins 2021-02-21 14:26:19 +01:00
Leigh Morresi
63eea2d6db Linting fixups 2021-02-21 14:21:14 +01:00
Leigh Morresi
b0c5dbd88e Just use the current/previous md5 2021-02-21 13:46:16 +01:00
Leigh Morresi
1718e2e86f Finalse pytest methods 2021-02-21 13:41:00 +01:00
Leigh Morresi
87f4347fe5 hack of pytest implementation - doesnt work yet 2021-02-16 21:35:28 +01:00
Leigh Morresi
93ee65fe53 Tidy up a few broken datastore paths 2021-02-12 19:43:05 +01:00
Leigh Morresi
9f964b6d3f WIP, separate out the Flask from everything else, get pytest working 2021-02-12 19:24:30 +01:00
Leigh Morresi
47e5a7cf09 Avoid accidently using Python's objects that are copied - but land as a 'soft reference', need to use a better dict struct in the future #6 2021-02-05 18:43:35 +01:00
Leigh Morresi
d07cf53a07 Minor fix to 'last changed' field, simplify template and logic 2021-02-04 13:15:39 +01:00
Leigh Morresi
5e31ae86d0 Use a thread locker and cleaner separation of concerns between main thread and site status fetch 2021-02-04 12:38:48 +01:00
Leigh Morresi
07f41782c0 Adding SEND_FILE_MAX_AGE_DEFAULT to ensure backups etc dont get old 2021-02-03 09:45:58 +01:00
Leigh Morresi
f1c2ece32f Use a pool of thread workers, better for huge lists of watchers 2021-02-02 16:29:06 +01:00
Leigh Morresi
eecc620386 https://github.com/psf/requests/issues/4525 - brotli compression is not yet supported in requests, be sure that users cant accidently use this content type encoding in the headers 2021-02-02 11:49:43 +01:00
Leigh Morresi
81534d9367 Add [diff] mechanism 2021-01-31 19:55:35 +01:00
Leigh Morresi
43c7ccb3fe Use a single thread for writing the sync json 2021-01-31 18:49:14 +01:00
Leigh Morresi
bfcb17ca24 Remove import for old lib 2021-01-30 10:29:39 +01:00
Leigh Morresi
98f6f4619f Switch to inscriptis
prepare config backend struct
2021-01-30 10:14:19 +01:00
Leigh Morresi
fbe20d45cc Support for custom headers per watch 2021-01-29 19:12:39 +01:00
Leigh Morresi
324c54fe46 Use requests's r.text so we dont have to deal with charsets 2021-01-29 13:05:31 +01:00
Leigh Morresi
b7a0c2dbcd Add edit UI
Move to keyed structure instead of list
2021-01-29 10:49:05 +01:00
Leigh Morresi
9c0c8bf6aa Remove actual :// links, dont consider these as part of the changes, often they include variables/trackingscript ref etc 2021-01-28 14:45:01 +01:00
Leigh Morresi
b574a28f1f Tweak comments 2021-01-28 11:12:42 +01:00
Leigh Morresi
01359e4811 Store a history of changes, used for future lookup/diff/explore changes UI 2021-01-28 10:39:14 +01:00
Leigh Morresi
93562afb02 Adding README amd docker info 2021-01-27 20:51:55 +01:00
Leigh Morresi
f455f14efd Primitive support for extra headers 2021-01-27 20:09:40 +01:00
Leigh Morresi
a4f1f6ab69 Handle titles and links 2021-01-27 19:54:17 +01:00
Leigh Morresi
1968d400fe Store the html2text version too 2021-01-27 19:04:39 +01:00
Leigh Morresi
0515aca7dd small fixes 2021-01-27 17:35:32 +01:00
Leigh Morresi
646a54945a Handle errors better, use the plaintext output 2021-01-27 15:56:59 +01:00
Leigh Morresi
2f018ac04c Workon threads 2021-01-27 15:12:31 +01:00