Commit Graph

10626 Commits

Author SHA1 Message Date
remitamine 4b464a6a78 [washingtonpost] improve format extraction and add support for video pages extraction 2016-05-23 00:48:11 +01:00
Sergey M․ 5db9df622f
[life:embed] Use native hls 2016-05-23 04:22:09 +06:00
Sergey M․ 5181759c0d
[life] Update _VALID_URL 2016-05-23 04:00:08 +06:00
Sergey M․ e54373204a
[lifenews] Fix metadata extraction 2016-05-23 03:44:04 +06:00
remitamine 102810ef04 [voxmedia] fix volume embed extraction 2016-05-22 20:37:35 +01:00
Yen Chi Hsuan 78d3b3e213
[generic] Improve Livestream detection (closes #2234) 2016-05-23 01:40:11 +08:00
Yen Chi Hsuan 7a46542f97
[livestream] Video IDs should always be strings (#2234) 2016-05-23 01:40:11 +08:00
Yen Chi Hsuan eb7941e3e6
[compat] Fix for XML with <!DOCTYPE> in Python 2.7 and 3.2
Such XML documents cause DeprecationWarning if python is run
with `-W error`
2016-05-23 01:40:11 +08:00
remitamine db3b8b2103 [tf1] add support for more related web sites 2016-05-22 17:03:17 +01:00
remitamine c5f5155100 [wat] extract all formats 2016-05-22 17:03:17 +01:00
Yen Chi Hsuan 4a12077855
[genric] Eliminate duplicated video URLs (closes #6562) 2016-05-22 22:23:20 +08:00
Thor77 70346165fe [bandcamp] raise ExtractorError when track not streamable (#9465)
* [bandcamp] raise ExtractorError when track not streamable

* [bandcamp] update md5 for second test

* don't rely on json-data, but just check for 'file'

* don't rely on presence of 'file'
2016-05-22 14:15:39 +08:00
Sergey M․ e9297256d4
[update] Fix youtube-dl.exe updating from arbitrary directory (Closes #2718) 2016-05-22 10:06:45 +06:00
Sergey M․ c6b9cf05e1
[utils] Do not fail on unknown date formats in unified_strdate 2016-05-22 08:28:41 +06:00
Sergey M․ 4d8819d249
[extractor/generic] Add support for theplatform embeds (Closes #8636, closes #9476) 2016-05-22 06:52:39 +06:00
Sergey M․ 898f4b49cc
[theplatform] Add _extract_urls 2016-05-22 06:47:22 +06:00
Sergey M․ 0150a00f33
[cc] Add test for tosh.cc (Closes #9566) 2016-05-22 02:58:41 +06:00
TRox1972 c8831015f4
[ComedyCentral] Add support for tosh.cc.com and cc.com/video-clips 2016-05-22 02:55:10 +06:00
Sergey M․ 92d221ad48
[periscope] Update uploader_id (Closes #9565) 2016-05-22 02:39:15 +06:00
Sergey M․ 0db9a05f88
[periscope:user] Adapt to layout changes (Closes #9563) 2016-05-22 02:15:56 +06:00
Philipp Hagemeister e03b35b8f9 release 2016.05.21.2 2016-05-21 21:47:39 +02:00
Philipp Hagemeister 598869afb1 release 2016.05.21.1 2016-05-21 21:27:00 +02:00
Philipp Hagemeister c8cc3745fb release 2016.05.21 2016-05-21 21:18:59 +02:00
Jaime Marquínez Ferrándiz 4c718d3c50 [rtve] Recognize 'filmoteca' URLs 2016-05-21 17:37:35 +02:00
Yen Chi Hsuan 115c65793a
[jwplatform] Don't fail with RTMP URLs without mp4:, mp3: or flv: 2016-05-21 13:50:38 +08:00
Yen Chi Hsuan 661d46b28f
[cbslocal] Add new extractor (closes #9522) 2016-05-21 13:40:45 +08:00
Yen Chi Hsuan 5ce3d5bd1b
[sendtonews] Add new extractor
Used in CBSLocal. Part of #9522
2016-05-21 13:39:42 +08:00
Yen Chi Hsuan 612b5f403e
[jwplatform] Improved m3u8 and rtmp support
Changes made for SendtoNewsIE. Part of #9522
2016-05-21 13:38:01 +08:00
Yen Chi Hsuan 9f54e692d2
[anvato] Add new extractor
Used in CBSLocal (#9522)
2016-05-21 13:18:29 +08:00
Yen Chi Hsuan 7b2fcbfd4e
[common] Skip TYPE=CLOSED-CAPTIONS lines in m3u8 manifests
According to [1], valid values for TYPE are AUDIO, VIDEO, SUBTITLES
and CLOSED-CAPTIONS. Such a value is found in Anvato master playlists,
though I don't use _extract_m3u8_formats() in the end.

Part of #9522.

[1] https://tools.ietf.org/html/draft-pantos-http-live-streaming-19#section-4.3.4.1
2016-05-21 13:16:28 +08:00
Yen Chi Hsuan 16da9bbc29
[common] Add _m3u8_meta_format() template
For extractors who handle m3u8 manifests by themselves. (eg., AnvatoIE)

Part of #9522
2016-05-21 13:15:28 +08:00
Sergey M․ c8602b2f9b
[nrk] Unquote subtitles' URLs 2016-05-21 05:09:16 +06:00
Sergey M․ b219f5e51b
[brightcove:new] Improve error reporting 2016-05-21 00:59:06 +06:00
Sergey M․ 1846e9ade0
[localnews8] Fix extractor (Closes #9539) 2016-05-20 22:31:08 +06:00
TRox1972 6756602be6
[LocalNews8] add extractor (Closes #9200) 2016-05-20 22:10:13 +06:00
Sergey M․ 6c114b1210
[extractor/generic] Remove generic id and title from wistia extractionand update tests 2016-05-20 21:55:35 +06:00
Sergey M․ 7ded6545ed
[extractor/generic] Add test for wistia standard embed 2016-05-20 21:43:36 +06:00
Sergey M․ aa5957ac49
[extractor/generic] Add support for async wistia embeds (Closes #9549) 2016-05-20 21:33:31 +06:00
remitamine 64413f7563 [cbc] fix extraction for flv only videos(fixes #5309) 2016-05-20 16:21:23 +01:00
Sergey M․ 45f160a43c
[wistia] Improve hls support 2016-05-20 21:16:08 +06:00
Sergey M․ 36ca2c55db
[wistia] Skip storyboard and improve extraction 2016-05-20 21:04:01 +06:00
Sergey M․ f0c96af9cb
[wistia] Add alias and modernize 2016-05-20 20:55:10 +06:00
Yen Chi Hsuan 31a70191e7
[cbc] Add the test case from #5156 2016-05-20 19:04:50 +08:00
Yen Chi Hsuan ad96b4c8f5
[common] Extract audio formats in SMIL
Found in http://www.cbc.ca/player/play/2657631896

Closes #5156
2016-05-20 19:02:53 +08:00
Yen Chi Hsuan 043dc9d36f
[cbc] Fix for old-styled URLs
The URL http://www.cbc.ca/player/News/ID/2672225049/ (#6342) redirects
to http://www.cbc.ca/player/play/2672224672, while youtube-dl wasn't
able to handle it correctly.
2016-05-20 18:39:54 +08:00
remitamine 52f7c75cff [cbc] extract http formats and update tests 2016-05-20 06:58:46 +01:00
Sergey M․ f6e588afc0
[24video] Fix description extraction 2016-05-20 08:53:04 +06:00
remitamine a001296703 [learnr] Add new extractor(closes #4284) 2016-05-19 18:18:03 +01:00
Sergey M․ dd81769c62
[ndtv] Fix extraction 2016-05-19 04:34:19 +06:00
Sergey M․ 46bc9b7d7c
[utils] Allow None in remove_{start,end} 2016-05-19 04:31:30 +06:00
remitamine b78531a36a [formula1] Add new extractor(closes #3617) 2016-05-18 22:24:46 +01:00
Sergey M․ 11e6a0b641
[nfb] Modernize and extract subtitles 2016-05-18 00:25:15 +06:00
Sergey M․ 15cda1ef77
[nfb] Fix uploader extraction 2016-05-17 23:46:47 +06:00
Yen Chi Hsuan 055f0d3d06
[abcnews] Added a new extractor (closes #3992)
Related: #6108, #8664, #9459
2016-05-17 15:38:57 +08:00
Yen Chi Hsuan cdd94c2eae
[utils] Check for None values in SOCKS proxy
Originally reported at
https://github.com/rg3/youtube-dl/pull/9287#issuecomment-219617864
2016-05-17 14:38:15 +08:00
Philipp Hagemeister 36755d9d69 release 2016.05.16 2016-05-16 17:25:47 +02:00
Sergey M․ f7199423e5
[groupon] Add support for Youtube embeds (Closes #9508) 2016-05-16 00:30:13 +06:00
Sergey M․ a0a81918f1
[collegehumor] Remove extractor
It now uses brightcove
2016-05-15 22:07:51 +06:00
Yen Chi Hsuan 5572d598a5
[hearthisat] Update the first test 2016-05-15 15:44:04 +08:00
Yen Chi Hsuan cec9727c7f
[hearthisat] Detect invalid download links (fixes #9440) 2016-05-15 15:35:31 +08:00
Yen Chi Hsuan 79298173c5
[utils] Fix getheader in urlhandle_detect_ext
Fixes #7049, related to #9440
2016-05-15 15:34:50 +08:00
Sergey M․ 69c9cc2716
[xvideos] Extract html5 player formats (Closes #9495) 2016-05-15 03:38:04 +06:00
Sergey M․ ed56f26039
[extractor/common] Improve name extraction for m3u8 formats 2016-05-15 03:34:35 +06:00
Sergey M․ 6f41b2bcf1
[extractor/generic] Improve 3qsdn embeds support (Closes #9453) 2016-05-14 23:58:25 +06:00
Sergey M․ cda6d47aad
[utils] Simplify integer conversion in js_to_json 2016-05-14 23:41:57 +06:00
Sergey M․ 5d39176f6d
[extractor/generic:3qsdn] Add support for embeds 2016-05-14 23:40:34 +06:00
Sergey M․ 5c86bfe70f
[3qsdn] Add extractor 2016-05-14 23:35:03 +06:00
Sergey M․ ca950f49e9
[ora] Revert extraction to regexes
It's less fragile than using js_to_json with ora js
2016-05-14 20:45:18 +06:00
Sergey M․ 89ac4a19e6
[utils] Process non-base 10 integers in js_to_json 2016-05-14 20:39:58 +06:00
felix 640eea0a0c
[ora] minimise fragile regex shenanigans; recognise unsafespeech.com URLs 2016-05-14 20:13:06 +06:00
felix bd1e484448
[utils] js_to_json: various improvements
now JS object literals like { /* " */ 0: ",]\xaa<\/p>", } will be correctly converted to JSON.
2016-05-14 20:12:39 +06:00
Yen Chi Hsuan a834622b89 Merge pull request #9492 from jwilk/teamcoco
[teamcoco] Fix base64 regexp
2016-05-14 20:02:40 +08:00
Jakub Wilk 66e7ace17a Don't hardcode errno constant
The value of ENOENT is architecture-dependent, so don't assume it's
always 2.
2016-05-14 13:41:41 +02:00
Jakub Wilk 791ff52f75 [teamcoco] Fix base64 regexp 2016-05-14 13:19:54 +02:00
Sergey M․ b5abf86148
[cinemassacre] Remove extractor (Closes #9457)
It now uses jwplatform
2016-05-14 04:53:14 +06:00
Sergey M․ 134c6ea856
[YoutubeDL] Sanitize url for url and url_transparent extraction results 2016-05-14 04:46:38 +06:00
remitamine 0730be9022 [sina] fix extraction(fixes #1146) 2016-05-13 20:25:01 +01:00
Sergey M․ 96c2e3e909
[imdb] Improve extraction 2016-05-13 23:25:05 +06:00
Sergey M․ f196508f7b
[imdb] Relax _VALID_URL (Closes #9481) 2016-05-13 22:19:00 +06:00
Yen Chi Hsuan cc1028aa6d
[openload] Fix extraction (closes #9472) 2016-05-13 18:11:08 +08:00
remitamine ad55e10165 [brightcove] change the protocol for m3u8 formats to m3u8_native 2016-05-13 08:35:38 +01:00
remitamine 18cf6381f6 [nrk] extract m3u8 formats 2016-05-13 08:05:28 +01:00
remitamine cdf32ff15d [extractors] add import for UstudioEmbedIE 2016-05-13 05:25:32 +01:00
remitamine 99d79b8692 [ustudio] add support ustudio app/embed urls 2016-05-13 05:21:45 +01:00
remitamine b9e7bc55da [mgtv] extract http formats 2016-05-12 22:46:23 +01:00
Sergey M․ d8d540cf0d
[nrk] Rework extractor (Closes #9470) 2016-05-13 02:07:12 +06:00
Sergey M․ 0df79d552a
[twitch:bookmarks] Remove extractor
Bookmarks no longer available
2016-05-13 00:14:30 +06:00
Sergey M․ 0db3a66162
[twitch] Skip dead tests 2016-05-12 23:57:52 +06:00
Yen Chi Hsuan 7581bfc958
[utils] Unquote crendentials passed to SOCKS proxies
Fixes #9450
2016-05-13 00:27:25 +08:00
Yen Chi Hsuan a3fa6024d6
[bloomberg] Fix test_Bloomberg
In this test case, sometimes HLS is the best format while sometimes HDS
is. To prevent occasional test failures, force HDS to be the best
format. In the past, testing against HDS formats causes the same error
as #9214, which is fixed as #9377 landed.
2016-05-12 20:08:42 +08:00
Yen Chi Hsuan 1b405bb47d
[downloader/f4m] Tolerate truncate segments when testing
Replaces #9216

Fixes #9214 and test_Bloomberg partially
2016-05-12 20:02:36 +08:00
Yen Chi Hsuan 7e8ddca1bb
[vevo] Delay the georestriction check to prevent false alerts
Fixes #9408
2016-05-12 19:56:58 +08:00
Yen Chi Hsuan 778a1ccca7
[utils] Add Œ and œ found in French to ACCENT_CHARS
Fixes #9463
2016-05-12 19:48:48 +08:00
Yen Chi Hsuan 4540515cb3
[iqiyi] Fix 1080P extraction (closes #9446) 2016-05-12 18:48:27 +08:00
Sergey M․ e0741fd449
[__init__] Simplify colon presence check 2016-05-11 22:03:30 +06:00
teemuy e73b9c65e2
Bugfix: Allow colons in custom HTTP header values. 2016-05-11 21:59:24 +06:00
Yen Chi Hsuan 702ccf2dc0
[compat] Rename shlex_quote and remove unused subprocess_check_output 2016-05-10 16:00:21 +08:00
Philipp Hagemeister 28b4f73620 release 2016.05.10 2016-05-10 09:08:08 +02:00
Yen Chi Hsuan 6ddb4888d2
[options] Update --proxy description for SOCKS proxies 2016-05-10 14:51:38 +08:00
Yen Chi Hsuan fa5cb8d021
[socks] Remove a superfluous clause 2016-05-10 14:51:38 +08:00