| 
						
					 | 
					
						
						
							
						
						bd0efb1529
					 | 
					
						
						
							
							crawler: missing os import
						
						
						
						
						
						
					 | 
					
						2020-08-23 18:45:44 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						47a17614ef
					 | 
					
						
						
							
							Rename morss/cgi.py into morss/wsgi.py
						
						
						
						
						
						
						
						To avoid name collision with the built-in cgi lib 
						
						
					 | 
					
						2020-08-23 18:44:49 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						4dfebe78f7
					 | 
					
						
						
							
							Pick caching backend via env vars
						
						
						
						
						
						
					 | 
					
						2020-08-23 18:43:18 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						dcd3e4a675
					 | 
					
						
						
							
							cgi.py: add missing impots
						
						
						
						
						
						
					 | 
					
						2020-08-23 18:31:05 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						e968b2ea7f
					 | 
					
						
						
							
							Remove leftover :debug code
						
						
						
						
						
						
					 | 
					
						2020-08-23 16:59:34 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						0ac590c798
					 | 
					
						
						
							
							Set MAX_/LIM_* settings via env var
						
						
						
						
						
						
					 | 
					
						2020-08-23 16:09:58 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						fa1b5aef09
					 | 
					
						
						
							
							Instructions for DEBUG= use
						
						
						
						
						
						
					 | 
					
						2020-08-23 15:31:11 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						7f6309f618
					 | 
					
						
						
							
							README: :silent was explained twice
						
						
						
						
						
						
					 | 
					
						2020-08-23 14:34:04 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						f65fb45030
					 | 
					
						
						
							
							:debug completely deprecated in favour of DEBUG=
						
						
						
						
						
						
					 | 
					
						2020-08-23 14:33:32 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						6dd40e5cc4
					 | 
					
						
						
							
							cli.py: fix Options code
						
						
						
						
						
						
					 | 
					
						2020-08-23 14:25:09 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						0acfce5a22
					 | 
					
						
						
							
							cli.py: remove log
						
						
						
						
						
						
					 | 
					
						2020-08-23 14:24:57 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						97ccc15db0
					 | 
					
						
						
							
							cgi.py: rename parseOptions to parse_options
						
						
						
						
						
						
					 | 
					
						2020-08-23 14:24:23 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						7a560181f7
					 | 
					
						
						
							
							Use env var for DEBUG
						
						
						
						
						
						
					 | 
					
						2020-08-23 14:23:45 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						baccd3b22b
					 | 
					
						
						
							
							Move parseOptions to cgi.py
						
						
						
						
						
						
						
						As it is no longer used in cli.py 
						
						
					 | 
					
						2020-08-22 00:37:34 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						f79938ab11
					 | 
					
						
						
							
							Add :silent to readme & argparse
						
						
						
						
						
						
					 | 
					
						2020-08-22 00:02:08 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						5b8bd47829
					 | 
					
						
						
							
							cli.py: remove draft code
						
						
						
						
						
						
					 | 
					
						2020-08-21 23:59:12 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						b5b355aa6e
					 | 
					
						
						
							
							readabilite: increase penalty for high link density
						
						
						
						
						
						
					 | 
					
						2020-08-21 23:55:04 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						94097f481a
					 | 
					
						
						
							
							sheet.xsl: better handle some corner cases
						
						
						
						
						
						
					 | 
					
						2020-08-21 23:54:35 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						8161baa7ae
					 | 
					
						
						
							
							sheet.xsl: improve css
						
						
						
						
						
						
					 | 
					
						2020-08-21 23:54:12 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						bd182bcb85
					 | 
					
						
						
							
							Move cli code to argParse
						
						
						
						
						
						
						
						Related code changes (incl. :format=xyz) 
						
						
					 | 
					
						2020-08-21 23:52:56 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						c7c2c5d749
					 | 
					
						
						
							
							Removed unused filterOptions code
						
						
						
						
						
						
					 | 
					
						2020-08-21 23:23:33 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						c6b52e625f
					 | 
					
						
						
							
							split morss.py into __main__/cgi/cli.py
						
						
						
						
						
						
						
						Should hopefully allow cleaner code in the future 
						
						
					 | 
					
						2020-08-21 22:17:55 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						c6d3a0eb53
					 | 
					
						
						
							
							readabilite: clean up code
						
						
						
						
						
						
					 | 
					
						2020-07-15 00:49:34 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						c628ee802c
					 | 
					
						
						
							
							README: add docker-compose instructions
						
						
						
						
						
						
					 | 
					
						2020-07-13 20:50:39 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						6021b912ff
					 | 
					
						
						
							
							morss: fix item removal
						
						
						
						
						
						
						
						Usual issue when editing a list while looping over it 
						
						
					 | 
					
						2020-07-06 19:25:48 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						f18a128ee6
					 | 
					
						
						
							
							Change :first for :newest
						
						
						
						
						
						
						
						i.e. toggle default for the more-obvious option 
						
						
					 | 
					
						2020-07-06 19:25:17 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						64af86c11e
					 | 
					
						
						
							
							crawler: catch html parsing errors
						
						
						
						
						
						
					 | 
					
						2020-07-06 12:25:38 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						15951d228c
					 | 
					
						
						
							
							Add :first to NOT sort items by date
						
						
						
						
						
						
					 | 
					
						2020-07-06 11:39:08 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						c1b1f5f58a
					 | 
					
						
						
							
							morss: restrict iframe use from :get to avoid abuse
						
						
						
						
						
						
					 | 
					
						2020-06-09 12:33:37 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						985185f47f
					 | 
					
						
						
							
							morss: more flexible feed creator auto-detection
						
						
						
						
						
						
					 | 
					
						2020-06-08 13:03:24 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						3190d1ec5a
					 | 
					
						
						
							
							feeds: remove useless if(len) before loop
						
						
						
						
						
						
					 | 
					
						2020-06-02 13:57:45 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						9815794a97
					 | 
					
						
						
							
							sheet.xsl: make text more self explanatory
						
						
						
						
						
						
					 | 
					
						2020-05-27 21:42:00 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						758b6861b9
					 | 
					
						
						
							
							sheet.xsl: fix text alignment
						
						
						
						
						
						
					 | 
					
						2020-05-27 21:36:11 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						ce4cf01aa6
					 | 
					
						
						
							
							crawler: clean up encoding detection code
						
						
						
						
						
						
					 | 
					
						2020-05-27 21:35:24 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						dcfdb75a15
					 | 
					
						
						
							
							crawler: fix chinese encoding support
						
						
						
						
						
						
					 | 
					
						2020-05-27 21:34:43 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						4ccc0dafcd
					 | 
					
						
						
							
							Basic help for sub-lib interactive use
						
						
						
						
						
						
					 | 
					
						2020-05-26 19:34:20 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						2fe3e0b8ee
					 | 
					
						
						
							
							feeds: clean up other stylesheets before putting ours
						
						
						
						
						
						
					 | 
					
						2020-05-26 19:26:36 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						ad3ba9de1a
					 | 
					
						
						
							
							sheet.xsl: add <select/> to use :firstlink
						
						
						
						
						
						
					 | 
					
						2020-05-13 12:33:12 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						68c46a1823
					 | 
					
						
						
							
							morss: remove deprecated twitter/fb link handling
						
						
						
						
						
						
					 | 
					
						2020-05-13 12:31:09 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						91be2d229e
					 | 
					
						
						
							
							morss: ability to use first link from desc instead of default link
						
						
						
						
						
						
					 | 
					
						2020-05-13 12:29:53 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						038f267ea2
					 | 
					
						
						
							
							Rename :theforce into :force
						
						
						
						
						
						
					 | 
					
						2020-05-13 11:49:15 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						22005065e8
					 | 
					
						
						
							
							Use etree.tostring 'method' arg
						
						
						
						
						
						
						
						Gives appropriately formatted html code.
Some pages might otherwise be rendered as blank. 
						
						
					 | 
					
						2020-05-13 11:44:34 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						7d0d416610
					 | 
					
						
						
							
							morss: cache articles for 24hrs
						
						
						
						
						
						
						
						Also make it possible to refetch articles, regardless of cache 
						
						
					 | 
					
						2020-05-12 21:10:31 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						5dac4c69a1
					 | 
					
						
						
							
							crawler: more code comments
						
						
						
						
						
						
					 | 
					
						2020-05-12 20:44:25 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						36e2a1c3fd
					 | 
					
						
						
							
							crawler: increase size limit from 100KiB to 500
						
						
						
						
						
						
						
						I'm looking at you, worldbankgroup.csod.com/ats/careersite/search.aspx 
						
						
					 | 
					
						2020-05-12 19:34:16 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						83dd2925d3
					 | 
					
						
						
							
							readabilite: better parsing
						
						
						
						
						
						
						
						Keeping blank_text keeps the tree more as-it, making the final output closer to expectations 
						
						
					 | 
					
						2020-05-12 14:15:53 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						e09d0abf54
					 | 
					
						
						
							
							morss: remove deprecated peace of code
						
						
						
						
						
						
					 | 
					
						2020-05-07 16:05:30 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						ff26a560cb
					 | 
					
						
						
							
							Shift safari work around to morss.py
						
						
						
						
						
						
					 | 
					
						2020-05-07 16:04:54 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						74d7a1eca2
					 | 
					
						
						
							
							sheet.xsl: fix word wrap
						
						
						
						
						
						
					 | 
					
						2020-05-06 16:58:28 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					| 
						
					 | 
					
						
						
							
						
						eba295cba8
					 | 
					
						
						
							
							sheet.xsl: fixes for safari
						
						
						
						
						
						
					 | 
					
						2020-05-06 12:01:27 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 |