Subject: | Doesn't find a valid URI following a "\w.http", e.g. "club...http://bit.ly/9QfKVL" |
While working on Bot::Twatterhose I noticed that URI::Find fails on
~0.5% of URLs posted on Twitter (16/2583 in my tests from
/public_timeline). This is because it doesn't grok $schema://[..]
directly following e.g. "foo...", i.e. "foo...http://x.org".
Here's a real world example of a Twitter user expressing concern over
what he perceives to be an effeminate dance act:
#letsbeclear this dance is super gay & I bet not ever see a n*gga
doin it in the club...http://bit.ly/9QfKVL
Another example:
The technology of magnetic energy has become so powerful an entire
house can...http://bit.ly/8yEdeb
Due to this serious bug I've been missing out on the latest dance
moves, and I've apparently been paying too much for my energy.