1026 Commits

Author SHA1 Message Date
bfaf7b0fac feeds: clean up default item_link
To be supported by feeds' `_rule_parse`
2022-01-24 16:16:14 +00:00
32d9bc9d9d feeds: proceed with conversion when rules do not match 2022-01-24 09:34:57 +00:00
b138f11771 util: support more data_files location 2022-01-23 12:40:18 +01:00
a01258700d More ordering options 2022-01-23 12:27:07 +01:00
4d6d3c9239 wsgi: limit supported mimetypes & return actual mimetype 2022-01-23 11:44:07 +01:00
e81f6b173f readabilite: remove code duplicate 2022-01-23 11:41:32 +01:00
fe5dbf1ce0 wsgi: reuse mimetype table from crawler 2022-01-22 13:22:39 +01:00
fdf9acd32b helper: fix reload code 2022-01-19 13:44:15 +01:00
d05706e056 crawler: fix typo 2022-01-19 13:41:12 +01:00
e88a823ada feeds: better handle rulesets without a 'mode' specified 2022-01-19 13:08:33 +01:00
750850c162 crawler: avoid too many .append() 2022-01-19 13:04:33 +01:00
c8669002e4 feeds: exotic xpath in html as well 2022-01-17 14:22:48 +00:00
c524e54d2d feeds: support some exotic xpath rules returning a single string 2022-01-17 13:59:58 +00:00
ef14567d87 Handle morss-helper with setup.py 2022-01-08 16:10:51 +01:00
fb643f5ef1 readabilite: remove unneeded reference to features (overriden by builder) 2022-01-03 18:01:12 +00:00
dbdca910d8 readabilite: fix new parser code & drop PIs 2022-01-03 17:51:49 +00:00
9eb19fac04 readabilite: use custom html parser within bs4's lxml parser
Solves the following obscure error:
ValueError: Invalid PI name 'b'xml''
2022-01-03 16:26:17 +00:00
d424e394d1 readabilite: use lxml bs4 parser for speed 2022-01-01 14:52:48 +01:00
3f92787b38 readabilite: limit html comments related issues 2022-01-01 13:58:42 +01:00
afc31eb6e9 readabilite: avoid double parsing of html 2022-01-01 12:51:30 +01:00
87d2fe772d wsgi: fix py2 compatibility 2022-01-01 12:35:41 +01:00
917aa0fbc5 crawler: do not re-save cached response
Otherwise cache never gets invalidated!
2021-12-31 19:28:11 +01:00
3e2b81286f xsl: add link to favicon
To limit error output when failing to fetch favicon.ico
2021-12-31 19:25:53 +01:00
15430a2b83 helper: restore run if no param passed 2021-12-29 23:35:16 +01:00
ecdb74812d Make helper & main.py executable 2021-12-29 15:47:05 +01:00
2c7844942c drone: re order deploy commands 2021-12-29 15:41:29 +01:00
e12cb4567a helper: more debug options 2021-12-29 15:41:03 +01:00
b74365b121 Make helper more posix compliant 2021-12-29 15:40:43 +01:00
2020543469 Make morss-helper executable 2021-12-29 15:37:12 +01:00
676be4a4fe helper: work around for systems only having py3 binary 2021-12-29 14:07:12 +01:00
8870400a6e Clean up morss-helper 2021-12-28 16:30:20 +01:00
8e9cc541b0 Turns out exec array is not supported in HEALTHCHECK 2021-12-28 15:23:40 +01:00
2a7a1b83ec Use alpine:edge to have up-to-date py packages 2021-12-28 13:41:42 +01:00
106f59afa1 docker: shift HEALTHCHECK to helper 2021-12-27 16:08:55 +01:00
ee514e2da3 helper: remove unneeded sudo 2021-12-25 22:00:22 +00:00
e7578e859a Clean up install/exec 2021-12-25 18:21:55 +01:00
3bcb8db974 Improve cloud-init (append & env var) 2021-12-25 11:02:27 +01:00
7751792942 Shift htaccess to README 2021-12-24 18:03:55 +01:00
6e2e5ffa00 README: cloud-init indication for env var 2021-12-24 11:44:56 +01:00
f6da0e1e9b Make use of GUNICORN_CMD_ARGS 2021-12-24 11:44:24 +01:00
2247ba13c5 drone: clean up file 2021-12-23 12:03:50 +01:00
d17b9a2f27 Fix typo in DISKCACHE_DIR var name 2021-12-23 12:02:24 +01:00
5ab45e60af README: scale back on logos 2021-12-16 09:03:43 +00:00
368e4683d6 util: clean paths code 2021-12-16 08:53:18 +00:00
9fd8c7d6af drone: add back install command on deploy
Was lost on the way
2021-12-14 15:42:02 +00:00
89f5d07408 drone: use docker for ssh
ssh pipelines require a separate runner
2021-12-14 15:33:38 +00:00
495bd44893 drone: escape full command 2021-12-14 15:16:21 +00:00
ff12dbea39 drone: escape $ sign 2021-12-14 15:12:22 +00:00
7885ab48df drone: deploy 2021-12-14 15:10:46 +00:00
7cdcbd23e1 wsgi: fix another typo 2021-12-14 12:06:08 +00:00