Commit Graph

190 Commits (d6b90448f3dbfd07d8b6fdfb5bbb20f28b3a331f)

Author SHA1 Message Date
pictuga 4a88886767 morss: get_page to act as a basic proxy (for iframes) 2020-04-04 16:37:15 +02:00
pictuga ae3bd58386 README: clarify newsreader hook syntax 2020-03-20 11:43:19 +01:00
pictuga e3be9b5a9e README: improve layout 2020-03-20 11:41:43 +01:00
pictuga f8c09af563 README: improve syntax highlighting 2020-03-20 11:33:52 +01:00
pictuga 48beeede97 README: update cache part 2020-03-20 11:27:05 +01:00
pictuga b3b90c067a morss.py: remove "useless" functions
Have to keep the code clean
2020-03-20 11:19:06 +01:00
pictuga a1c8093b6e README: improve md layout 2020-03-19 23:06:25 +01:00
pictuga e681dd56c2 README: reflect new code location 2020-03-19 23:04:21 +01:00
pictuga ab145813d6 README: remove ref to dropped deps 2020-03-18 16:48:07 +01:00
pictuga 9dbe061fd6 Remove markdown-related code
Time to clean up the code and stop with those non-core features
They just make the code harder to maintain
2020-03-18 16:47:00 +01:00
pictuga 4536902530 README: make new licence clear 2018-11-11 16:13:32 +01:00
pictuga 3c09dbdf31 README: list some non-obvious features 2018-09-30 21:56:30 +02:00
pictuga ba673392d7 README: test website no longer online 2018-09-30 21:56:03 +02:00
pictuga d091e74d56 crawler: add MySQL backend
With extra dependency
2017-11-04 14:51:41 +01:00
pictuga e0092387b1 Add "six" as a requirement in README
Actually a dependency of "chardet", but since people (including me) might just send file onto their server instead of installing the libs properly, it's a nice reminder (not in requirements.txt since that one is automatic)
2017-07-30 16:58:53 +02:00
pictuga d3bc2926fc Remove :hungry
Mostly usless. If you need it, you might as well not need to use morss in the first place...
2017-03-25 13:52:58 -10:00
pictuga 6024728341 README: mention / > | in webserver 2017-03-20 21:05:40 -10:00
pictuga c929ce6278 README: update about options.items 2017-03-20 21:02:25 -10:00
pictuga beec6469cc README: link to todo-list 2017-03-18 22:38:50 -10:00
pictuga 10fef6be4e README: "update" markdown format
Thanks GitHub for changing what you accept
2017-03-18 22:36:32 -10:00
pictuga 3385198f4f README: remove removed :force 2017-03-08 19:20:43 -10:00
pictuga 92b4a5c57c README: remove :smart doc as it was dropped 2017-03-08 17:38:00 -10:00
Florian Muenchbach 543ae89e22 Added new 'encoding' option to README. 2017-03-08 17:38:00 -10:00
pictuga 795a0b281f README: add wheezy.template as dep 2017-03-08 15:25:45 -10:00
pictuga af66ab5980 README: simplify the sample file tree for mod_cgi
So that it never gets outdated again...
2017-03-08 15:21:26 -10:00
pictuga fe3f6b7e33 README: Fetch->FeedFetch 2017-03-08 15:17:40 -10:00
pictuga 2b4269e1ac README: add chardet 2017-03-08 11:45:13 -10:00
pictuga 52661985d1 Remove one more readability fork reference 2017-02-28 19:25:59 -10:00
pictuga 47d33099b2 Remove readability fork from deps
Since we now use our own, faster, implementation
2017-02-28 19:21:51 -10:00
pictuga 466d8e47d6 Also make buriy's readability port compatible
Should be faster, and it now supports py3
2015-08-29 18:33:12 +02:00
pictuga 9b911213b6 Convenient README update
Break lines, update info, say something about uwsgi
2015-08-29 12:45:36 +02:00
pictuga 554bdb4650 Fix "typo" in README 2015-08-29 12:32:40 +02:00
pictuga 5a9d3ff0bd Add xpath idea to README 2015-05-04 22:23:08 +08:00
pictuga ec7184b8f6 README: python3 supported 2015-04-07 19:01:41 +08:00
pictuga 32aa96afa7 Cache HTTP content using a custom Handler
Much much cleaner. Nothing comparable
2015-04-06 23:26:12 +08:00
pictuga b3572e143d New way of calling the program
python -m morss, python morss/main.py
2015-03-11 14:23:14 +08:00
pictuga 6ae60d0343 2to3: py3-compatible readability fork 2015-03-03 01:03:03 +08:00
pictuga 7a66115012 README: use py -m morss.morss
Needed for 2to3 relative imports
2015-02-25 11:56:34 +08:00
pictuga d5269964fc Make :theforce also bypass http errors 2015-02-21 01:58:16 +08:00
pictuga ef946c0712 XML pretty-print in separate option
Who reads plain XML anyway?
2015-02-20 17:38:39 +08:00
pictuga f46576168a Add :mono to disable multithreading
Convenient to have linear logging
2014-11-10 23:14:54 +01:00
pictuga c5e3aa664a Update README after After split 2014-11-08 20:32:34 +01:00
pictuga dedb7fa5a4 Add :smart to README 2014-11-07 18:42:57 +01:00
pictuga aed2a9bdd3 Add new options to README 2014-11-07 10:20:33 +01:00
pictuga 5f8288eecb Add :hungry to fill feeds with long intros 2014-06-28 01:43:31 +02:00
pictuga 484432d804 Add :callback for JSONP calls 2014-06-28 00:59:57 +02:00
pictuga 226441d821 Add :cors for cross-domain XHR (with README update) 2014-06-28 00:59:13 +02:00
pictuga 230659a34b Reenable args with values 2014-06-28 00:58:37 +02:00
pictuga 3c859423b3 Add requirements, edit setup.py and README 2014-06-21 16:38:48 +02:00
pictuga 3fbeb892ca Fix typo in README 2014-06-21 16:30:18 +02:00
pictuga 7211093cc5 Add :smart :noref modes, update README 2014-06-16 14:00:02 +02:00
pictuga 814ff46fbd README: explanation of what morss.process is 2014-05-24 19:17:58 +02:00
pictuga 1728e3902c README: cache no longer mandatory 2014-05-24 19:14:12 +02:00
pictuga 3aea7cb8da Further document lib usage 2014-05-24 19:03:05 +02:00
pictuga cf1f5d79fa More README info on lib use 2014-05-22 23:13:25 +02:00
pictuga 11923e0a6d Fix python lib code in README
Missing 'morss.' added
2014-05-22 22:47:18 +02:00
pictuga b03d865b7b Get rid of ParseOptions()
That thing wasn't nice, and depended too much on the various use case. The new approach is to turn morss into a library and turn the use cases into some pre-implemented lib usages
2014-05-22 22:44:59 +02:00
pictuga 16713e6d2a Fix README sample python code 2014-05-21 15:05:35 +02:00
pictuga 49f0be1919 Change README to enjoy Github syntax highlighting 2014-05-21 12:46:48 +02:00
pictuga 433374b227 Quick dirty python library use example
I'm fully aware of the ugliness of the code provided
2014-05-21 12:44:05 +02:00
pictuga a27f0cd0b6 Fix README misspell 2014-05-21 12:27:29 +02:00
pictuga 78cd27a68e README disclaimer about poor code quality 2014-05-21 12:14:26 +02:00
pictuga c0a6e5c70f Update README for the server
Tell about /www provided files, and roughly how to use it
2014-05-21 12:09:50 +02:00
pictuga de7c3f887f Make last use case howto similar to the others 2014-05-21 12:07:15 +02:00
pictuga 8c7177602d Updated dependencies, unbalanced parenthesis 2014-01-13 00:23:20 +01:00
pictuga 6eac20535f readme: Add csv, md 2014-01-13 00:16:58 +01:00
pictuga 4038033336 Update README
New settings (eg. al, txt, silent), included HTTP server, new todo item (readability rewrite)
2014-01-09 20:34:12 +01:00
pictuga bb6a3b3d4d Update README todo
JSON output and own HTTP server have been implemented
2014-01-09 19:57:03 +01:00
pictuga 03884be1bd Update README
LIM_TIME, arguments, CLI use case, facebook api, feedify, arguments explanation
2013-11-16 17:48:21 +01:00
pictuga 4d2d17f9e1 Tell about dateutil in readme 2013-10-12 23:43:09 +02:00
pictuga 2eb6e69b5a full-text with a dash in README 2013-09-15 17:59:17 +02:00
pictuga a94d659bc8 Make negation in README more obvious 2013-08-25 00:01:00 +02:00
pictuga 6e891ef6ff Nicer link display in readme 2013-07-11 14:17:04 +02:00
pictuga 981e83fd1e Add link to online test version 2013-07-11 14:11:23 +02:00
pictuga 89662ccbae typo in readme 2013-06-19 22:16:46 +03:00
pictuga 16f2e3b4c3 todo and newsreader hook update in readme
Updated liferea use to reflect code changes. Link to morss.it as live "preview". Added a todo. Added dependencies list.
2013-06-19 21:12:03 +02:00
pictuga 9ad9ffaf91 Use proper markdown for links in readme 2013-06-11 13:10:40 +02:00
pictuga f09dfbacf5 Warning in README: no http server provided 2013-05-23 21:54:11 +02:00
pictuga a8feac9811 Detail MAX settings in README 2013-05-23 21:48:45 +02:00
pictuga ee08cccf9c Updated README since SERVER var drop. 2013-04-28 11:37:11 +02:00
pictuga 4abf7b699c Use readability to fetch article content.
Makes the whole "xpath rules" things useless. Almost any feed is now supported. CSS liferea stylesheets are also uneeded now, since readability cleans up html code a more efficient way. README was updated.
2013-04-19 11:37:43 +02:00
pictuga 437b0da8a9 Updated README to reflect 404 redirection support. 2013-04-19 11:30:34 +02:00
pictuga af8879049f Another huge commit.
Now uses OOP where it fits. Atom feeds are supported, but no real tests were made. Unix globbing is now possible for urls. Caching is done a cleaner way. Feedburner links are also replaced. HTML is cleaned a more efficient way. Code is now much cleaner, using lxml.objectify and a small wrapper to access Atom feeds as if they were RSS feeds (and much faster than feedparser). README has been updated.
2013-04-15 18:51:55 +02:00
pictuga ad25516e34 Speak about deleteTags in README. 2013-04-04 18:31:26 +02:00
pictuga 82084c2c75 Move to OOP.
This is a huge commit. The whole code is ported to Object-Oritented Programming. This makes the code cleaner, which became required to deal with all the different cases, for example with encoding detection. Encoding detection now works better, and uses 3 different methods. HTML pages with an xml declaration are now supported. Feed urls with parameters (eg. "index.php?option=par") are also supported. Cache is now smarter, since it no longer grows indefinitely, since only in-use pages are kept in the cache. Caching is now mandatory. urllib (not urllib2) is no longer needed. Solved a possible crash with log function (when passing list of str with non-unicode encoging).
README is also updated.
2013-04-04 17:43:30 +02:00
pictuga f734fb2623 Added quick licence information. 2013-03-29 20:05:53 +01:00
pictuga 682ab253b0 Typo in README 2013-02-25 21:56:16 +01:00
pictuga 217ff0fd8f Use better markdown syntax for default xpath rule 2013-02-25 21:55:17 +01:00
pictuga 27b0fbaf01 Speak about default xpath in README 2013-02-25 21:54:04 +01:00
pictuga be17f0c78f Updated README to markdown 2013-02-25 21:49:38 +01:00