Commit Graph

549 Commits (8e9cc541b0145c83f779df83991148721d982bec)

Author SHA1 Message Date
pictuga 862fe3cae4 Use more recent user-agent 2014-06-16 14:01:01 +02:00
pictuga 7211093cc5 Add :smart :noref modes, update README 2014-06-16 14:00:02 +02:00
pictuga f991802d9e Try to use less server-specific code for FB tokens 2014-06-16 13:57:53 +02:00
pictuga 9285525256 Unify internal/external errors 2014-06-16 13:55:59 +02:00
pictuga cdef40fbbe Fix Cache saving crash
Because was deleting values of a dict while looping over its values...
2014-06-07 19:14:31 +02:00
pictuga f90958149e Add :reader
Uses wheezy.template, which is said to be fast and light. Provided template file is really basic, custom css suggested.
2014-05-29 14:12:16 +02:00
pictuga b66ac2bc5e Make it possible not to use caching 2014-05-24 19:13:41 +02:00
pictuga 25fdca4bf0 Add do-it-all function
For quick lib use
2014-05-24 19:02:22 +02:00
pictuga 26c91070f5 Time-based Cache
Solves the :proxy issue for good. More convenient, more flexible
2014-05-24 19:01:21 +02:00
pictuga 5e64696031 Fix '/morss.py/' url fixer 2014-05-22 22:53:36 +02:00
pictuga 364fbc4ba6 Remove apparent limit
Cause no longer works, cause of all-bool args introduced earlier
2014-05-22 22:52:49 +02:00
pictuga b03d865b7b Get rid of ParseOptions()
That thing wasn't nice, and depended too much on the various use case. The new approach is to turn morss into a library and turn the use cases into some pre-implemented lib usages
2014-05-22 22:44:59 +02:00
pictuga 3c48c58127 Remove useless HOLD var
Was needed in DEBUG at some point
2014-05-21 12:19:49 +02:00
pictuga e8e7f170a6 Include super dumb http file server
For index.html, other files can be added, but everything has to be hard-coded (mimetype included)
2014-05-18 12:34:23 +02:00
pictuga c41a1fe226 Support for wikipedia fetured articles feed
Should work with most wikipedias
2014-05-18 12:17:14 +02:00
pictuga d8a3c4e9af Add support for Google News 2014-05-18 11:58:45 +02:00
pictuga bbf1ffbb15 Remove 'persistent' and 'dic' arg in Cache
'dic' was mostly intended for facebook now-bygone advanced buggy token storage. 'persistent' was needed by fb and 'proxy' mode, but a small workaround was found for the proxy mode (basically making sure the cache object is always at least 5-item long)
2014-05-15 00:54:40 +02:00
pictuga 76e7f1ea00 Try to use more generic 302/303 redirections
Still far from being great, but at least I can use it on both morss.it and test.morss.it now
2014-05-14 15:05:14 +02:00
pictuga 031b67a8db Remove some useless options
progress and a accidentaly-disclosed one, cause useless
2014-05-14 15:03:40 +02:00
pictuga 974bad7974 Fix and strip down facebook
Remove unstable non-working facebook semi-automatic token renewal (a simple warning on morss.it should be enough). Also commited some forgotten stuff.
2014-05-14 15:01:41 +02:00
pictuga b7136f2056 Pull iTunes raw feed out of iTunes url
This iTunes thinggy somehow qualifies as yet-another-apple-tech-rape: just some old tech behind iron curtains…
2014-05-12 23:15:51 +02:00
pictuga d8074d6b6d Redirect google translate links to original link
Cause anyway Google Translate isn't scrappable. So it's better to have at least some content.
2014-03-22 20:53:33 +01:00
pictuga a4cf5e0daa Google link cleaner now works on all .dot versions 2014-03-22 20:52:25 +01:00
pictuga c94ef92131 Fix Facebook support
Now token is grabbed directly by the server, and sent back by means of a cookie. This does unify token "creation" and renewal.
2014-02-21 14:36:06 +01:00
pictuga a1f5c3db3a Have .csv files be downloaded
So that users can open it in LibreOffice/OpenOffice/Word without having to save it to disk beforehands
2014-02-05 00:37:12 +01:00
pictuga 6c33bb6e1c Safer Cache saving
Create tmp file and then move it to destination. Avoids corrupt files during write
2014-01-29 20:36:45 +01:00
pictuga 6eaec96af7 Keep "dic" param in Cache.new 2014-01-22 15:56:08 +01:00
pictuga 4e549dc88a Change lim/max settings only for current "run" 2014-01-19 23:36:41 +01:00
pictuga 0f7bc568e4 Send CGI HTTP headers earlier
So that browsers show that sth is going on
2014-01-15 21:02:47 +01:00
pictuga 4d6ef92504 Separate function for output. Add csv 2014-01-13 00:10:57 +01:00
pictuga 7fbe728f93 Feeds: allow json, csv export
Uses OrderedDict
2014-01-13 00:08:03 +01:00
pictuga ec55f5e856 Use smarter order for RSS.dict 2014-01-13 00:07:04 +01:00
pictuga 3d78cfb638 Fix HTTP bug when returning empty page 2014-01-11 18:21:37 +01:00
pictuga 840b0b1ded Remove yet another silly log message 2014-01-11 18:18:02 +01:00
pictuga 8209f243bb Fix rss-redirection code
And add log, which was lost when splitting functions (which made this fix needed)
2014-01-11 18:15:36 +01:00
pictuga 3b3ac4c8a6 Remove batch of useless imports 2014-01-11 17:31:27 +01:00
pictuga 5feb061bf7 First attempt at decent folder structure
Use setup.py, subfolder for code.
2014-01-11 17:11:57 +01:00
pictuga 851dacdfbc Renamed to .py. 2013-04-04 18:17:12 +02:00
pictuga 6783bbf992 Improved shebang. 2013-04-04 17:56:37 +02:00
pictuga 82084c2c75 Move to OOP.
This is a huge commit. The whole code is ported to Object-Oritented Programming. This makes the code cleaner, which became required to deal with all the different cases, for example with encoding detection. Encoding detection now works better, and uses 3 different methods. HTML pages with an xml declaration are now supported. Feed urls with parameters (eg. "index.php?option=par") are also supported. Cache is now smarter, since it no longer grows indefinitely, since only in-use pages are kept in the cache. Caching is now mandatory. urllib (not urllib2) is no longer needed. Solved a possible crash with log function (when passing list of str with non-unicode encoging).
README is also updated.
2013-04-04 17:43:30 +02:00
pictuga 05b5bc7783 Catch extra errors (timeout). 2013-03-29 20:06:31 +01:00
pictuga 6f6c5fbaad Faster xml cleaning 2013-03-01 14:26:51 +01:00
pictuga e305f387ab Hopefully fixed encoding issues
with the dirtiest trick out there...
2013-02-27 15:12:32 +01:00
pictuga ed8a45875c Default to "//h1/.." since most website use it
because it is said to be good for SEO. Debug now requires env variable "DEBUG" to be set to something else than "".
2013-02-25 21:36:02 +01:00
pictuga d39604c453 Support for cookies added
NYT needs them
2013-02-25 20:53:59 +01:00
pictuga d6179a734f Clearer debug info 2013-02-25 20:53:22 +01:00
pictuga eb63ce3f4f Handle more errors 2013-02-25 18:32:23 +01:00
pictuga b63f91a151 Added cache, easier debug 2013-02-25 18:01:59 +01:00
pictuga 51fe6ce81b First commit 2013-02-25 15:50:32 +01:00