Commit Graph

279 Commits

Author SHA1 Message Date
Andrew Dolgov 2d9c568439 add some timing information to update_daemon_common 2014-08-19 10:32:01 +04:00
Andrew Dolgov d5e490952a fix language detection; remove f_guid debugging 2014-08-19 10:18:32 +04:00
Andrew Dolgov eb16bd9f01 pass correct instance of pluginhost to calculate_article_hash 2014-08-18 23:11:29 +04:00
Andrew Dolgov af244f927e tweak article hash function 2014-08-18 21:55:48 +04:00
Andrew Dolgov b1840673cd simplify article filter chaining mechanism, do not try to process already stored articles 2014-08-18 18:32:16 +04:00
Andrew Dolgov 0084f0d1ba update stored article after filter has finished processing to prevent plugins from reverting changes of earlier invoked plugins (experimental) 2014-08-12 19:36:29 +04:00
Felix Eckhofer 523bd90baf Store size of enclosure to database 2014-07-15 16:23:46 +02:00
Dave Zaikos a3d5018490 Updated cache_images() to use _MIN_CACHE_IMAGE_SIZE constant when checking file size. New constant is defined upon including rssfuncs.php if not already defined. 2014-06-26 17:22:59 -04:00
wltb 01465325b4 Add subscribe hook, give more information to fetch_feed hook 2014-05-19 14:06:52 +02:00
Andrew Dolgov 15d0528a9f rssfuncs: mention local cache filename for easier debugging 2014-04-30 08:49:46 +00:00
Roland Angerer f77c72038f deactivated FeedParser re-use as it is not multiuser safe
different users may have different fetch feed and feed fetched hooks
2014-03-14 08:36:44 +01:00
Roland Angerer 95beaa1471 fixed feed caching logic for multiuser scenario
only vanilla feed data may be cached as every user may have different
fetched feed hooks to apply
2014-03-14 08:35:39 +01:00
Andrew Dolgov e95bb179a8 fix for article language breaking sql insert 2014-03-06 09:02:14 +04:00
Andrew Dolgov 5ba1ddd409 rssfuncs: remove unused variables 2014-02-19 15:17:29 +04:00
Andrew Dolgov 537eca4000 rssfuncs: fix typo 2013-12-18 14:22:24 +04:00
Andrew Dolgov f08426e3d3 when checking feeds to update, respect only default update setting stored in default profile 2013-12-18 13:55:50 +04:00
Andrew Dolgov 5a450b8760 add workaround for languagedetect idiotic shit of some kind 2013-11-13 20:36:15 +04:00
Andrew Dolgov d8179cb9d9 pubsubhubbub: use atom rel=self link (when available) when subscribing
to push-enabled feeds
2013-11-11 22:52:15 +04:00
Andrew Dolgov 4ad04ee227 report all libxml errors in updater debug output
force utf8 encoding if devforceupdate is on
parser: try to convert non-unicode feeds with specified encoding to utf8
before trying to remove dangling utf8 characters in case of utf8-related
libxml errors because doing so produces garbage content
2013-10-29 12:15:26 +04:00
Andrew Dolgov 88edaa9344 daemon: cache parser object while processing a batch of feeds withsame url on first success 2013-10-25 14:42:43 +04:00
Andrew Dolgov 5ddd2705ca make language detection optional (closes #779) 2013-09-27 13:45:21 +04:00
Andrew Dolgov 1357a263be include title when detecting article language 2013-09-17 12:28:31 +04:00
Andrew Dolgov 4f71d7431c replace suppress debugging kludge with a more flexible function (fixes
logging with update.php --feeds being stopped after first feed)
2013-09-15 23:02:21 +04:00
Andrew Dolgov a33558a61e pass logfile to child tasks if locking is possible, lock logfile before writing, add kludge to prevent update_rss_feed unneeded debugging go into master logfile 2013-09-02 12:33:59 +04:00
Andrew Dolgov f73e03e000 pass feed information to article filters 2013-09-02 10:03:04 +04:00
Andrew Dolgov 5c54e68388 support media:description for media: enclosures 2013-08-05 12:26:09 +04:00
Andrew Dolgov edba269b6f fix entries not inserted properly when no languages are detected 2013-08-02 16:03:13 +04:00
Andrew Dolgov 00f22824d7 rss: force language to 2 characters; run house keeping hooks properly 2013-08-02 14:47:34 +04:00
Andrew Dolgov 8e47022036 add hook_house_keeping 2013-08-02 14:06:18 +04:00
Andrew Dolgov 2fc4d981d1 remove unused old-style image rewritign 2013-08-02 14:04:00 +04:00
Andrew Dolgov 6b4617970f add text_languagedetect to guess article language for better hyphenation
(bump schema)
2013-07-31 10:30:17 +04:00
Andrew Dolgov 0997c2bd62 Revert "add temporary hack to store original unhashed guid into cached_content for debugging"
This reverts commit 8096e309a5.
2013-07-14 21:48:14 +04:00
Andrew Dolgov 8096e309a5 add temporary hack to store original unhashed guid into cached_content for debugging 2013-07-11 21:40:26 +04:00
Andrew Dolgov c052e25a8b remove unused cached_content 2013-07-10 16:55:55 +04:00
Andrew Dolgov 420940fa90 do not catchup newly subscribed feeds 2013-06-25 10:01:41 +04:00
Andrew Dolgov 6791af0cfd pass feed it to feed_fetched and fetch_feed hooks 2013-05-20 15:28:56 +04:00
Andrew Dolgov ee65bef405 add HOOK_FETCH_FEED 2013-05-20 15:26:53 +04:00
Andrew Dolgov 0ad2013bd2 update_rss_feed: remove unused override_url parameter 2013-05-20 15:20:14 +04:00
Andrew Dolgov 47673e6611 add fetch_url and owner_uid to HOOK_FEED_FETCHED 2013-05-18 09:22:06 +04:00
Andrew Dolgov d1f3fa9791 try to force-convert feed data to utf8 2013-05-17 20:09:43 +04:00
Andrew Dolgov fd687300bf Revert "subscribe_to_feed: use already fetched data when updating initially"
This reverts commit 23923fb29b.
2013-05-08 19:22:33 +04:00
Andrew Dolgov 23923fb29b subscribe_to_feed: use already fetched data when updating initially 2013-05-07 15:34:20 +04:00
Rasmus Lerdorf 6f7798b643 Fixing bugs found by static analysis 2013-05-07 00:35:10 -07:00
Andrew Dolgov f4ae0f053b update: remove debugging block 2013-05-02 10:26:32 +04:00
Andrew Dolgov 566417c4e7 restore updstart threshold 2013-05-02 02:31:32 +04:00
Andrew Dolgov 5d3e5a1bb2 simplify feed cache age handling (reduce caching to sequential updates) 2013-05-02 02:30:53 +04:00
Andrew Dolgov 5de4010487 disable marking for the time being 2013-05-02 02:21:11 +04:00
Andrew Dolgov 5ef8409700 move the precautionary timestamp bumping 2013-05-02 02:20:34 +04:00
Andrew Dolgov 5d0d3887af add _DISABLE_HTTP_304 2013-05-02 02:11:11 +04:00
Andrew Dolgov 15c762beda updater: show owner_uid for checked feeds 2013-05-02 02:02:49 +04:00
Andrew Dolgov 52637d3b30 remove cache valid bailout clause 2013-05-02 01:36:17 +04:00
Andrew Dolgov 865a3ed6a0 change feed cache file extension 2013-05-02 01:33:02 +04:00
Andrew Dolgov d4992d6b48 add support for dc:subject and slash:comments 2013-05-01 20:55:08 +04:00
Andrew Dolgov ee78f81ccd update: better tag-related debugging info 2013-05-01 20:33:59 +04:00
Andrew Dolgov 852d4ac890 support RDF-XML feeds 2013-05-01 20:30:52 +04:00
Andrew Dolgov fd0daa9b55 remove simplepie 2013-05-01 19:14:48 +04:00
Andrew Dolgov 431e27851b actually save feed xml in the cache 2013-05-01 18:10:27 +04:00
Andrew Dolgov 99429e57e4 remove simplepie entity decode hacks 2013-05-01 18:07:05 +04:00
Andrew Dolgov b8f316dc28 change caching to save xml data, remove RDF init section 2013-05-01 17:56:21 +04:00
Andrew Dolgov 04d2f9c831 add basic rss support 2013-05-01 17:38:16 +04:00
Andrew Dolgov cd07592c29 add basic tinyparser/atom 2013-05-01 17:04:57 +04:00
Andrew Dolgov 65c8d5e76d update: set last_updated to now() when process starts 2013-05-01 15:43:56 +04:00
Andrew Dolgov 39ede9862f experimental: decode numerical utf entities on import in entry title 2013-04-29 16:59:36 +04:00
Andrew Dolgov efe46a3b53 Merge pull request #167 from KonishchevDmitry/pr-allow-slash-in-filter-regex
Allow slashes in filter regular expressions
2013-04-27 02:10:27 -07:00
Dmitry Konishchev 7b80b5e160 Match each tag separately against user filter regular expression
Each article's tag should be matched against user filter regular
expression separately. Current matching confuses when you want to match
an exact tag. You suppose to write "^tag$", bug now have to write
"(^|,)tag(,|$)" which is very inconvenient and requires knowledge about
how do you process this matching.
2013-04-26 16:30:25 +04:00
Dmitry Konishchev ffa1bd7b19 Allow slashes in filter regular expressions
User's regular expressions need escaping before passing them to
preg_match()
2013-04-26 15:46:48 +04:00
Andrew Dolgov 90e5f4f1de base if-modified-since on last received article, not feed update timestamp 2013-04-25 18:42:48 +04:00
Andrew Dolgov 23283f11a3 fetch: better checking for 1970- date 2013-04-25 16:12:49 +04:00
Andrew Dolgov aafd55bab6 do not needlessly mark icons as failed 2013-04-24 19:01:51 +04:00
Andrew Dolgov 84ceb6bd2e fix floicon failed message not respecting debugging setting 2013-04-24 16:48:47 +04:00
Andrew Dolgov e2cf81e214 unify houskeeping stuff, increase spawn interval 2013-04-24 14:54:59 +04:00
Andrew Dolgov 654650857f reduce daemon chattiness a bit 2013-04-24 14:26:05 +04:00
Andrew Dolgov 8cabc200d5 increase updstart locking interval to prevent concurrent updates; decrease batch size for the daemon 2013-04-24 14:23:38 +04:00
Andrew Dolgov fce451a4f7 fix updating for pgsql
increase default batch size
remove random check between feedbrowser or update task executing
2013-04-24 13:43:40 +04:00
Andrew Dolgov 81c2066345 updating: do not use cache timestamp if it is unusable for current feed;
sort chronologically instead of random
2013-04-24 13:21:40 +04:00
Andrew Dolgov 6ee0d4b05c do not recalculate avg color repeatedly 2013-04-23 22:37:06 +04:00
Andrew Dolgov 560cbd8c0d attempt to recalculate favicon avg color if it changes 2013-04-23 22:33:23 +04:00
Andrew Dolgov 36490f11a3 add a hack so that if floicon fails on some icon, it won't be processed again 2013-04-23 22:30:06 +04:00
Daniel Andersson 1134497129 Fix doubly applied basepaths to lockfiles. 2013-04-19 21:25:26 +02:00
Andrew Dolgov 68cccafcad remove debug_enabled global 2013-04-19 13:42:40 +04:00
Andrew Dolgov 9ec103525d update_rss_feed: simplify debug reporting 2013-04-19 13:26:10 +04:00
Andrew Dolgov ebec81a6fb subscribe: verify XML before adding to the database; fetch: try to work around entity problems if initial parsing fails 2013-04-19 13:17:28 +04:00
Andrew Dolgov 1367bc3f5e trim received feed data on fetch 2013-04-19 12:35:30 +04:00
Andrew Dolgov 96db9a14d9 do not try to calculate icon avg color if GD is not present 2013-04-19 08:42:35 +04:00
Andrew Dolgov ce2b219c64 entry_author: do not double escape string, author & comments: trim 2013-04-18 16:25:02 +04:00
Andrew Dolgov 1ffe3391f9 make pluginhost a singleton 2013-04-18 12:27:34 +04:00
Andrew Dolgov 52d88392da move db-prefs to OO 2013-04-18 12:00:01 +04:00
Andrew Dolgov 3f6f085769 add better gzdecode support for php native fetch, use curl under open_basedir 2013-04-18 11:36:13 +04:00
Andrew Dolgov d0d86316ee Revert "do automatic gzcdecode on feed data"
This reverts commit aea03558bb.
2013-04-18 11:19:25 +04:00
Andrew Dolgov aea03558bb do automatic gzcdecode on feed data 2013-04-18 09:15:55 +04:00
Andrew Dolgov 48cb253625 get_article_filter: fix missing argument 2013-04-17 19:25:15 +04:00
Andrew Dolgov a42c55f02b fix blank character after opening bracket in function calls 2013-04-17 18:34:18 +04:00
Andrew Dolgov 7440a7fe3c daemon2: use passthrough to daemon-loop 2013-04-17 18:19:28 +04:00
Andrew Dolgov 6322ac79a0 remove $link 2013-04-17 16:48:41 +04:00
Andrew Dolgov 857efe49e6 add schema checking to feed updating and update script 2013-04-17 12:10:35 +04:00
Andrew Dolgov e2261e177b implement old log entry purging in update_daemon_common; add some more error types; tweak error log display 2013-04-16 20:34:20 +04:00
Andrew Dolgov 5e3d5480f7 use hashed guids in a hopefully backwards compatible fashion 2013-04-15 19:28:11 +04:00
Andrew Dolgov bc7a144dd5 properly remove and replace favicon color when favicon gets manually removed/uploaded 2013-04-15 18:22:48 +04:00
Andrew Dolgov 12a6bd28ac remove hsl stuff from classes/feeds, we take care of this when choosing the color 2013-04-15 17:07:06 +04:00
Andrew Dolgov 6ac722d5b1 try to improve color guessing algorithm a bit 2013-04-15 17:01:51 +04:00