pictuga
a0f2e0d995
2to3: crawler.py improve except
2015-02-25 18:07:09 +08:00
pictuga
6a06b742f9
2to3: crawler.py port try as
2015-02-25 18:03:54 +08:00
pictuga
c2d85e2bf9
2to3: crawler.py port httplib
2015-02-25 18:02:29 +08:00
pictuga
4f224888d8
2to3: crawler.py port urllib2 and StringIO
2015-02-25 17:53:36 +08:00
pictuga
27cf8f6498
2to3: (iter)items to list
2015-02-25 12:02:53 +08:00
pictuga
3fb90cb7b4
2to3: local import
2015-02-25 11:57:10 +08:00
pictuga
47c8a511ff
2to3: print's
2015-02-25 11:57:10 +08:00
pictuga
7a66115012
README: use py -m morss.morss
...
Needed for 2to3 relative imports
2015-02-25 11:56:34 +08:00
pictuga
604b03e2ba
Delete desc when :keep=False
...
Still needed for Firefox, cause empty <desc/> still show up instead of content in feed preview
2015-02-24 00:38:34 +08:00
pictuga
83ed440e67
Fix issue when desc and content empty
...
Wouldn't put fetched article in feed
2015-02-24 00:38:02 +08:00
pictuga
5c23f90f0b
Disable options filtering by default
...
But still provide sample code
2015-02-21 02:01:32 +08:00
pictuga
149117029c
Improve logging of fetching errors
2015-02-21 01:58:45 +08:00
pictuga
d5269964fc
Make :theforce also bypass http errors
2015-02-21 01:58:16 +08:00
pictuga
f0dcb9912e
Fix cached errors handling
2015-02-21 01:57:33 +08:00
pictuga
f62aedda12
Double HTTP timeout
...
Better slow than nothing (especially when running on a personal computer)
2015-02-21 01:55:53 +08:00
pictuga
76c4211a04
Make :hungry more useful
2015-02-21 01:55:25 +08:00
pictuga
446dd9fb3f
Fix typo in FeedListDescriptor
...
Thanks @tehsphinx. Fixes #4 .
2015-02-20 17:41:14 +08:00
pictuga
ef946c0712
XML pretty-print in separate option
...
Who reads plain XML anyway?
2015-02-20 17:38:39 +08:00
pictuga
fcf4197801
Populate __init__.py
2015-02-19 13:05:59 +08:00
pictuga
ec5f5b865f
Make it easy to restrict available options
2014-11-21 22:01:03 +01:00
pictuga
105ca67744
Move facebook token to own script
...
To a PHP script actually. Not sure why PHP. Keeps morss' code cleaner. This piece of code had nothing to do in there, and didn't bring any advantage.
2014-11-19 20:09:27 +01:00
pictuga
a9654ea578
Fix encoding detection in feedify
2014-11-19 12:25:18 +01:00
pictuga
8131ea2244
HTTPS SSL certificate validation
...
Specific error message added
2014-11-19 11:59:59 +01:00
pictuga
1b26c5f0e3
Split SimpleDownload in a lot of Handlers
...
Cleaner code, easier to edit, more flexibility. Paves the way to SSL certificates validation.
Still have to clean up the code of AcceptHeadersHandler.
2014-11-19 11:57:40 +01:00
pictuga
f46576168a
Add :mono to disable multithreading
...
Convenient to have linear logging
2014-11-10 23:14:54 +01:00
pictuga
5dd262139d
Add HTTP error code to download error message
2014-11-09 15:45:01 +01:00
pictuga
6d5bb2b3c5
Print error message in wgi mode
2014-11-09 15:44:42 +01:00
pictuga
a820cf6812
Run :strip in After
...
Makes more sense
2014-11-09 15:01:50 +01:00
pictuga
607df4b123
Fix Twitter
...
They changed the html structure of the profile pages
2014-11-09 15:00:38 +01:00
pictuga
5eefe2c916
Log more when using wgi
2014-11-08 21:22:34 +01:00
pictuga
6f2061ff37
Fix :smart
...
Wasn't using the right way
2014-11-08 21:22:07 +01:00
pictuga
c5e3aa664a
Update README after After split
2014-11-08 20:32:34 +01:00
pictuga
40834eeb93
Split After into Before/After
...
Needed since a bunch of options needed to be run before the actual fetching (cause no-one needs to fetch the articles of to-be-dropped items)
2014-11-08 20:31:29 +01:00
pictuga
f20fb9cdf6
Use more stable loop-over-list in Gather
2014-11-08 20:30:36 +01:00
pictuga
6a40731248
Return output when DEBUG is on
...
Much more convenient to actually debug
2014-11-07 18:44:59 +01:00
pictuga
dedb7fa5a4
Add :smart to README
2014-11-07 18:42:57 +01:00
pictuga
d3eb2dd88d
Implement :smart to save bandwidth
2014-11-07 18:40:44 +01:00
pictuga
aed2a9bdd3
Add new options to README
2014-11-07 10:20:33 +01:00
pictuga
67fc5f06f8
Run "After" even when debug mode is on
2014-11-06 21:15:16 +01:00
pictuga
ad2673f474
Add :emtpy to remove all items
...
This is completely useless...
2014-11-06 21:14:41 +01:00
pictuga
ecfda1d05a
Add :strip to remove desc and content
2014-11-06 21:14:20 +01:00
pictuga
1a8ee716f3
Add "search" option
...
PLEASE NOTE that this is case sensitive and does really basic research ("is xyz in the title?"). Don't use this for fine filtering.
Also fixed an issue with After(), due to the fact that some functions were removing items from the feed while looping over the feed items, creating some anoying item-skipping issues.
2014-11-06 21:11:23 +01:00
pictuga
690bf43977
reader: show desc if no content is available
2014-10-26 19:22:57 +01:00
pictuga
0e22bb4316
Cache: catch json parse erros
2014-09-28 12:03:58 +02:00
pictuga
5f8288eecb
Add :hungry to fill feeds with long intros
2014-06-28 01:43:31 +02:00
pictuga
ac69b28f1b
Pass options to Fill
2014-06-28 01:43:09 +02:00
pictuga
6cc3e7eb93
Fix :callback and add content-type
2014-06-28 01:20:47 +02:00
pictuga
0ec7c2f3e6
Fix :callback crash
2014-06-28 01:13:29 +02:00
pictuga
484432d804
Add :callback for JSONP calls
2014-06-28 00:59:57 +02:00
pictuga
226441d821
Add :cors for cross-domain XHR (with README update)
2014-06-28 00:59:13 +02:00