pictuga
e98617c320
Time in twitter feedify
2013-10-09 00:50:10 +02:00
pictuga
12d48918d8
Add time handling in feeds.py
...
Lots of time format can be passed as a value (RSS/Atom time strings, timestamp, anything supported by dateutil.parser). Implemented in twitter feedify hook.
2013-10-09 00:48:05 +02:00
pictuga
16b5e4b018
Fix .htaccess for no-gzip when in 'progress' mode
2013-10-09 00:45:11 +02:00
pictuga
76b81a31ec
Drop desc by default
...
Arguably the reason is that users tend to believe the program does nothing, cause Firefox (and others?) only shows the description in the feed preview. The "keep" parameter keeps the description in place.
2013-10-02 12:06:14 +02:00
pictuga
ae22104831
Center "* * *" in clip mode
2013-10-02 12:05:52 +02:00
pictuga
196400967d
Type of self.xdel in feeds
2013-10-02 11:33:40 +02:00
pictuga
60fb90eb01
Add google support in feedify
2013-10-01 20:19:17 +02:00
pictuga
685bc7988e
Improve feedify string grabbing
2013-10-01 20:18:55 +02:00
pictuga
050be7690d
Ability to delete desc, content, title in feeds
2013-10-01 19:52:21 +02:00
pictuga
145f59b546
Fix already-filled feeds
...
By pulling the content from "content" when needed
2013-10-01 19:50:28 +02:00
pictuga
ae5e947417
Add support for facebook, google links
2013-10-01 19:49:53 +02:00
pictuga
cf5040020e
Use desktop browser UA for xml
...
Convenient for rss links detection and "feedify"
2013-10-01 19:47:45 +02:00
pictuga
e3c1cd8619
Accept more types as "text" before readability
2013-10-01 19:47:06 +02:00
pictuga
78706952fe
Remove "clip" from Fill
...
Put that in Gather. Also removed from feeds.py. "alone" mode was also added (it removes the description).
2013-10-01 19:45:54 +02:00
pictuga
1b7fe8fbee
Use "options" in Gather instead of "progress"
...
Also made it possible to toggle Fill's toggle through parameters
2013-09-29 15:32:58 +02:00
pictuga
a5a327388a
Add ability not to fetch an item's article
2013-09-25 13:47:05 +02:00
pictuga
9f22402dca
Twitter: use pahe <title> as feed title
2013-09-25 12:48:50 +02:00
pictuga
0657077191
Add support for twitter
...
Grabs "feed" from the html page, clips tweet and article together.
2013-09-25 12:37:14 +02:00
pictuga
da14242bcf
Add feedify, and use it in morss
2013-09-25 12:36:21 +02:00
pictuga
9bc4417be3
More flexible xml caching
...
New includes a 'type' var, to remember what we did out of it (normal, nothing, grabbed xml link, etc). xml/html mimetype are now saved in a dict, for easier editing, and consistency.
2013-09-25 12:32:40 +02:00
pictuga
edff54a016
Add pushContent in feeds.py
...
Useful for twitter (later) for it's "clip" toggle, which keeps the original desc/content above the article. Makes changing the content, while keeping the original stub in place, easier.
2013-09-25 12:18:22 +02:00
pictuga
208d70d3db
Use separate var in Fill for final url
...
That way the url can be changed altogether for the article-fetching part, without changing the item link itself. Useful for upcoming twitter feeds.
2013-09-25 11:51:48 +02:00
pictuga
fd1501a0c0
Check relative url earlier
2013-09-25 11:49:45 +02:00
pictuga
1e621099e0
Log cache hash in Gather
2013-09-25 11:15:11 +02:00
pictuga
3d6d7e70b6
Remove useless "as" in error catch
2013-09-25 11:14:22 +02:00
pictuga
e73cbf56c2
Add 'html' option, usefull to see error on server
2013-09-25 11:13:33 +02:00
pictuga
03014a8cbf
Typo in UA_HTML var name
2013-09-25 11:11:11 +02:00
pictuga
4a5cbcfd18
Move httplib in common code
...
Needed for error catch
2013-09-25 11:10:16 +02:00
pictuga
3fd34ff1a6
decodeHTML works without connection object
2013-09-25 11:08:58 +02:00
pictuga
e759cd46c6
Add set/getLinks in FeedItem (base)
2013-09-17 00:45:15 +02:00
pictuga
355bfa5efd
Adding items, creating feeds now possible
...
Both hakish to do, but works
2013-09-17 00:44:10 +02:00
pictuga
658f51e5a9
Support feeds handed out as text/html
...
<http://www.pro-linux.de/rss/index1.xml > and <http://tehrantimes.com/index.php?option=com_ninjarsssyndicator&feed_id=1&format=raw > are on an equal footing…
2013-09-16 00:33:24 +02:00
pictuga
8eb2f7c249
Added another letter to feedsportal table
2013-09-15 19:38:59 +02:00
pictuga
23246ca6c1
Save the key in cache file
2013-09-15 19:20:51 +02:00
pictuga
1b7777c331
Find RSS links within html pages' <head>
...
And cache those links
2013-09-15 19:19:50 +02:00
pictuga
1bd17f1365
Faster relative link resolution
2013-09-15 19:18:39 +02:00
pictuga
7575291f8f
Log url in Gather
...
Useful for upcoming commits
2013-09-15 18:53:35 +02:00
pictuga
532852a408
Use cleaner http error catch
...
One error type was inheriting from another one
2013-09-15 18:52:34 +02:00
pictuga
2eb6e69b5a
full-text with a dash in README
2013-09-15 17:59:17 +02:00
pictuga
43bf021f23
Catch more http exceptions
...
Such as InvalidURL. Subclasses of httplib.HTTPException
2013-09-15 17:19:33 +02:00
pictuga
9252e75923
Ensure var in parseOptions are defined
...
Caused a bug on morss.it's server
2013-09-15 15:56:08 +02:00
pictuga
75b51fc2c2
Add ability to bypass ETag support
...
Add the ":force" argument over http to bypass ETag support, which is convenient to debug code
2013-09-15 15:54:42 +02:00
pictuga
d2de6cf23d
Extra doc for DELAY, for xml cache, & now for etag
2013-09-15 15:45:15 +02:00
pictuga
89187ab6a6
Log generation time
2013-09-15 15:44:25 +02:00
pictuga
04840d9843
More flexible parameters can be passed
...
Multiple parameters can now be passed. HTTP "API" has been improved, and url now have to be like "http://<path to morss>/:<param1>:<param2>/<url>". The code handling the parameters parsing is now way cleaner. Debug toggle is now a var, which can be changed with parameters. Also http logging is no longer done into a file, which tended to grow way too fast, while lacking an "error 403 protection", but instead the parameter ":debug" can be passed in the url, and the page will be delivered as "text/plain" with the debug written into it. Therefore some logging had to be moved around, so as not to output anything during http headers definition.
2013-09-15 15:38:03 +02:00
pictuga
c25aec7107
Only perform <meta> redirects on html pages
2013-09-15 15:33:14 +02:00
pictuga
3176c2a8e8
Fix bad characters detection
...
Now works with any encoding, no longer restricted to utf-8. Uses regex to find encoding (not perfect, but rather fast, since it's used on a substring)
2013-09-15 14:57:37 +02:00
pictuga
3ba74649f6
Test if linked pages are text documents
...
Useful for feeds such as HackerNews
2013-09-10 15:25:55 +02:00
pictuga
1b7fdad6a8
Improve broken XML support
...
TPB feed is a good example <http://rss.thepiratebay.sx/blog >. Now supports ampersand in feed, using the "recover" mode in etree.parse. Broken utf-8 strings in feed are now also supported.
2013-09-08 15:48:34 +02:00
pictuga
5ebd84ee55
Fix broken feeds.py calls for items count
2013-09-08 15:47:15 +02:00