Commit Graph

295 Commits

Author SHA1 Message Date
Andrew Dolgov 59e83455f1 add some more bayes stuff 2015-06-17 15:15:04 +03:00
Andrew Dolgov 35c12dc40a add workaround for mysql only accepting subset of unicode characters 2015-06-16 08:17:49 +03:00
Andrew Dolgov 6de3a1be7b allow plugins modify article score 2015-06-15 20:01:07 +03:00
Andrew Dolgov 168cf351e2 add housekeeping task for orphaned counter cache entries 2015-03-06 14:14:51 +03:00
Andrew Dolgov a29fe12195 add auto_assign_labels plugin; allow article filter plugins to add labels to articles 2015-03-04 00:26:52 +03:00
Andrew Dolgov 0c6f7b314a remove image.php; put cached image endpoint to public.php 2015-02-27 14:51:26 +03:00
Andrew Dolgov e6c826fa2f remove AUTO_ASSIGN_LABELS (confusing, idiots keep enable it and complain of the results) 2015-02-21 13:08:50 +03:00
Andrew Dolgov c9299c28c6 comment out old-school test trgm code 2015-01-19 12:51:59 +03:00
Andrew Dolgov e44280a0bd fix typo (refs #818) 2014-11-02 20:08:58 +03:00
Andrew Dolgov 3cc02a5b88 Merge branch 'master' of github.com:gothfox/Tiny-Tiny-RSS 2014-11-02 19:37:27 +03:00
Andrew Dolgov 6383c9c53a when registering feed title, consider its maximum length (closes #818) 2014-11-02 19:36:58 +03:00
Grant Pannell da7e3e91bf Fix allow duplicate posts feature with optimizations from 1.14 2014-11-02 16:40:10 +10:30
Andrew Dolgov 522e8b3500 update debugging: add force_rehash 2014-08-21 13:20:41 +04:00
Jonathon Padfield 5d2e74a95e Minor refactor for php v5.3, which doens't support immediate dereferencing of array entries 2014-08-20 01:16:52 +00:00
Andrew Dolgov 02b0348a8a mention that language passed is read only 2014-08-19 14:05:06 +04:00
Andrew Dolgov 73f9378d26 update: include detected language to article structure passed to filters 2014-08-19 14:02:28 +04:00
Andrew Dolgov 2d9c568439 add some timing information to update_daemon_common 2014-08-19 10:32:01 +04:00
Andrew Dolgov d5e490952a fix language detection; remove f_guid debugging 2014-08-19 10:18:32 +04:00
Andrew Dolgov eb16bd9f01 pass correct instance of pluginhost to calculate_article_hash 2014-08-18 23:11:29 +04:00
Andrew Dolgov af244f927e tweak article hash function 2014-08-18 21:55:48 +04:00
Andrew Dolgov b1840673cd simplify article filter chaining mechanism, do not try to process already stored articles 2014-08-18 18:32:16 +04:00
Andrew Dolgov 0084f0d1ba update stored article after filter has finished processing to prevent plugins from reverting changes of earlier invoked plugins (experimental) 2014-08-12 19:36:29 +04:00
Felix Eckhofer 523bd90baf Store size of enclosure to database 2014-07-15 16:23:46 +02:00
Dave Zaikos a3d5018490 Updated cache_images() to use _MIN_CACHE_IMAGE_SIZE constant when checking file size. New constant is defined upon including rssfuncs.php if not already defined. 2014-06-26 17:22:59 -04:00
wltb 01465325b4 Add subscribe hook, give more information to fetch_feed hook 2014-05-19 14:06:52 +02:00
Andrew Dolgov 15d0528a9f rssfuncs: mention local cache filename for easier debugging 2014-04-30 08:49:46 +00:00
Roland Angerer f77c72038f deactivated FeedParser re-use as it is not multiuser safe
different users may have different fetch feed and feed fetched hooks
2014-03-14 08:36:44 +01:00
Roland Angerer 95beaa1471 fixed feed caching logic for multiuser scenario
only vanilla feed data may be cached as every user may have different
fetched feed hooks to apply
2014-03-14 08:35:39 +01:00
Andrew Dolgov e95bb179a8 fix for article language breaking sql insert 2014-03-06 09:02:14 +04:00
Andrew Dolgov 5ba1ddd409 rssfuncs: remove unused variables 2014-02-19 15:17:29 +04:00
Andrew Dolgov 537eca4000 rssfuncs: fix typo 2013-12-18 14:22:24 +04:00
Andrew Dolgov f08426e3d3 when checking feeds to update, respect only default update setting stored in default profile 2013-12-18 13:55:50 +04:00
Andrew Dolgov 5a450b8760 add workaround for languagedetect idiotic shit of some kind 2013-11-13 20:36:15 +04:00
Andrew Dolgov d8179cb9d9 pubsubhubbub: use atom rel=self link (when available) when subscribing
to push-enabled feeds
2013-11-11 22:52:15 +04:00
Andrew Dolgov 4ad04ee227 report all libxml errors in updater debug output
force utf8 encoding if devforceupdate is on
parser: try to convert non-unicode feeds with specified encoding to utf8
before trying to remove dangling utf8 characters in case of utf8-related
libxml errors because doing so produces garbage content
2013-10-29 12:15:26 +04:00
Andrew Dolgov 88edaa9344 daemon: cache parser object while processing a batch of feeds withsame url on first success 2013-10-25 14:42:43 +04:00
Andrew Dolgov 5ddd2705ca make language detection optional (closes #779) 2013-09-27 13:45:21 +04:00
Andrew Dolgov 1357a263be include title when detecting article language 2013-09-17 12:28:31 +04:00
Andrew Dolgov 4f71d7431c replace suppress debugging kludge with a more flexible function (fixes
logging with update.php --feeds being stopped after first feed)
2013-09-15 23:02:21 +04:00
Andrew Dolgov a33558a61e pass logfile to child tasks if locking is possible, lock logfile before writing, add kludge to prevent update_rss_feed unneeded debugging go into master logfile 2013-09-02 12:33:59 +04:00
Andrew Dolgov f73e03e000 pass feed information to article filters 2013-09-02 10:03:04 +04:00
Andrew Dolgov 5c54e68388 support media:description for media: enclosures 2013-08-05 12:26:09 +04:00
Andrew Dolgov edba269b6f fix entries not inserted properly when no languages are detected 2013-08-02 16:03:13 +04:00
Andrew Dolgov 00f22824d7 rss: force language to 2 characters; run house keeping hooks properly 2013-08-02 14:47:34 +04:00
Andrew Dolgov 8e47022036 add hook_house_keeping 2013-08-02 14:06:18 +04:00
Andrew Dolgov 2fc4d981d1 remove unused old-style image rewritign 2013-08-02 14:04:00 +04:00
Andrew Dolgov 6b4617970f add text_languagedetect to guess article language for better hyphenation
(bump schema)
2013-07-31 10:30:17 +04:00
Andrew Dolgov 0997c2bd62 Revert "add temporary hack to store original unhashed guid into cached_content for debugging"
This reverts commit 8096e309a5.
2013-07-14 21:48:14 +04:00
Andrew Dolgov 8096e309a5 add temporary hack to store original unhashed guid into cached_content for debugging 2013-07-11 21:40:26 +04:00
Andrew Dolgov c052e25a8b remove unused cached_content 2013-07-10 16:55:55 +04:00
Andrew Dolgov 420940fa90 do not catchup newly subscribed feeds 2013-06-25 10:01:41 +04:00
Andrew Dolgov 6791af0cfd pass feed it to feed_fetched and fetch_feed hooks 2013-05-20 15:28:56 +04:00
Andrew Dolgov ee65bef405 add HOOK_FETCH_FEED 2013-05-20 15:26:53 +04:00
Andrew Dolgov 0ad2013bd2 update_rss_feed: remove unused override_url parameter 2013-05-20 15:20:14 +04:00
Andrew Dolgov 47673e6611 add fetch_url and owner_uid to HOOK_FEED_FETCHED 2013-05-18 09:22:06 +04:00
Andrew Dolgov d1f3fa9791 try to force-convert feed data to utf8 2013-05-17 20:09:43 +04:00
Andrew Dolgov fd687300bf Revert "subscribe_to_feed: use already fetched data when updating initially"
This reverts commit 23923fb29b.
2013-05-08 19:22:33 +04:00
Andrew Dolgov 23923fb29b subscribe_to_feed: use already fetched data when updating initially 2013-05-07 15:34:20 +04:00
Rasmus Lerdorf 6f7798b643 Fixing bugs found by static analysis 2013-05-07 00:35:10 -07:00
Andrew Dolgov f4ae0f053b update: remove debugging block 2013-05-02 10:26:32 +04:00
Andrew Dolgov 566417c4e7 restore updstart threshold 2013-05-02 02:31:32 +04:00
Andrew Dolgov 5d3e5a1bb2 simplify feed cache age handling (reduce caching to sequential updates) 2013-05-02 02:30:53 +04:00
Andrew Dolgov 5de4010487 disable marking for the time being 2013-05-02 02:21:11 +04:00
Andrew Dolgov 5ef8409700 move the precautionary timestamp bumping 2013-05-02 02:20:34 +04:00
Andrew Dolgov 5d0d3887af add _DISABLE_HTTP_304 2013-05-02 02:11:11 +04:00
Andrew Dolgov 15c762beda updater: show owner_uid for checked feeds 2013-05-02 02:02:49 +04:00
Andrew Dolgov 52637d3b30 remove cache valid bailout clause 2013-05-02 01:36:17 +04:00
Andrew Dolgov 865a3ed6a0 change feed cache file extension 2013-05-02 01:33:02 +04:00
Andrew Dolgov d4992d6b48 add support for dc:subject and slash:comments 2013-05-01 20:55:08 +04:00
Andrew Dolgov ee78f81ccd update: better tag-related debugging info 2013-05-01 20:33:59 +04:00
Andrew Dolgov 852d4ac890 support RDF-XML feeds 2013-05-01 20:30:52 +04:00
Andrew Dolgov fd0daa9b55 remove simplepie 2013-05-01 19:14:48 +04:00
Andrew Dolgov 431e27851b actually save feed xml in the cache 2013-05-01 18:10:27 +04:00
Andrew Dolgov 99429e57e4 remove simplepie entity decode hacks 2013-05-01 18:07:05 +04:00
Andrew Dolgov b8f316dc28 change caching to save xml data, remove RDF init section 2013-05-01 17:56:21 +04:00
Andrew Dolgov 04d2f9c831 add basic rss support 2013-05-01 17:38:16 +04:00
Andrew Dolgov cd07592c29 add basic tinyparser/atom 2013-05-01 17:04:57 +04:00
Andrew Dolgov 65c8d5e76d update: set last_updated to now() when process starts 2013-05-01 15:43:56 +04:00
Andrew Dolgov 39ede9862f experimental: decode numerical utf entities on import in entry title 2013-04-29 16:59:36 +04:00
Andrew Dolgov efe46a3b53 Merge pull request #167 from KonishchevDmitry/pr-allow-slash-in-filter-regex
Allow slashes in filter regular expressions
2013-04-27 02:10:27 -07:00
Dmitry Konishchev 7b80b5e160 Match each tag separately against user filter regular expression
Each article's tag should be matched against user filter regular
expression separately. Current matching confuses when you want to match
an exact tag. You suppose to write "^tag$", bug now have to write
"(^|,)tag(,|$)" which is very inconvenient and requires knowledge about
how do you process this matching.
2013-04-26 16:30:25 +04:00
Dmitry Konishchev ffa1bd7b19 Allow slashes in filter regular expressions
User's regular expressions need escaping before passing them to
preg_match()
2013-04-26 15:46:48 +04:00
Andrew Dolgov 90e5f4f1de base if-modified-since on last received article, not feed update timestamp 2013-04-25 18:42:48 +04:00
Andrew Dolgov 23283f11a3 fetch: better checking for 1970- date 2013-04-25 16:12:49 +04:00
Andrew Dolgov aafd55bab6 do not needlessly mark icons as failed 2013-04-24 19:01:51 +04:00
Andrew Dolgov 84ceb6bd2e fix floicon failed message not respecting debugging setting 2013-04-24 16:48:47 +04:00
Andrew Dolgov e2cf81e214 unify houskeeping stuff, increase spawn interval 2013-04-24 14:54:59 +04:00
Andrew Dolgov 654650857f reduce daemon chattiness a bit 2013-04-24 14:26:05 +04:00
Andrew Dolgov 8cabc200d5 increase updstart locking interval to prevent concurrent updates; decrease batch size for the daemon 2013-04-24 14:23:38 +04:00
Andrew Dolgov fce451a4f7 fix updating for pgsql
increase default batch size
remove random check between feedbrowser or update task executing
2013-04-24 13:43:40 +04:00
Andrew Dolgov 81c2066345 updating: do not use cache timestamp if it is unusable for current feed;
sort chronologically instead of random
2013-04-24 13:21:40 +04:00
Andrew Dolgov 6ee0d4b05c do not recalculate avg color repeatedly 2013-04-23 22:37:06 +04:00
Andrew Dolgov 560cbd8c0d attempt to recalculate favicon avg color if it changes 2013-04-23 22:33:23 +04:00
Andrew Dolgov 36490f11a3 add a hack so that if floicon fails on some icon, it won't be processed again 2013-04-23 22:30:06 +04:00
Daniel Andersson 1134497129 Fix doubly applied basepaths to lockfiles. 2013-04-19 21:25:26 +02:00
Andrew Dolgov 68cccafcad remove debug_enabled global 2013-04-19 13:42:40 +04:00
Andrew Dolgov 9ec103525d update_rss_feed: simplify debug reporting 2013-04-19 13:26:10 +04:00
Andrew Dolgov ebec81a6fb subscribe: verify XML before adding to the database; fetch: try to work around entity problems if initial parsing fails 2013-04-19 13:17:28 +04:00
Andrew Dolgov 1367bc3f5e trim received feed data on fetch 2013-04-19 12:35:30 +04:00
Andrew Dolgov 96db9a14d9 do not try to calculate icon avg color if GD is not present 2013-04-19 08:42:35 +04:00