Commit graph

10640 commits

Author SHA1 Message Date
wankerer 4ee0b8afdb [eporner] fix for the new URL layout
Recently eporner slightly changed the URL layout, the ID that used to be
digits only are now digits and letters, so youtube-dl falls back to
the generic extractor that doesn't work.

Fix the matching regex to allow letters in ID.

[v2: added a test case]
2016-05-24 15:57:36 -07:00
remitamine 1de32771e1 [eyedotv] Add new extractor(closes #9582) 2016-05-24 20:10:12 +01:00
remitamine 688c634b7d skip some tests to reduce test time 2016-05-24 16:44:11 +01:00
Sergey M․ 6b43132ce9
[xhamster] Update tests 2016-05-24 21:38:27 +06:00
mexican porn commits a4690b3244
[xhamster] url regex fix for videos with empty title. 2016-05-24 21:35:43 +06:00
remitamine 444417edb5 [radiocanada] Add new extractor(#4020) 2016-05-24 15:58:27 +01:00
remitamine 277c7465f5 [ooyala] check manifest ext with determine_ext and update tests for related extractors 2016-05-24 11:24:29 +01:00
Kagami Hiiragi 25bcd3550e [vlive] Address site update
Changes:
* Fix video params extraction
* Don't make status request since status info now available on the page
* Remove unneeded code
* Fix test
2016-05-24 12:54:28 +03:00
remitamine a4760d204f [ooyala] use api v2 to reduce requests for format extraction 2016-05-24 00:22:29 +01:00
remitamine e8593f346a [ooyala] extract subtitles 2016-05-23 23:58:16 +01:00
remitamine 05b651e3a5 [washingtonpost] reduce requests for m3u8 manifests 2016-05-23 13:04:50 +01:00
remitamine 42a7439717 [cbs] allow to pass content id to the extractor(closes #9589) 2016-05-23 09:31:37 +01:00
remitamine b1e9ebd080 [washingtonpost] remove unnecessary code 2016-05-23 02:30:12 +01:00
remitamine 0c50eeb987 [reuters] Add new extractor 2016-05-23 02:27:31 +01:00
remitamine 4b464a6a78 [washingtonpost] improve format extraction and add support for video pages extraction 2016-05-23 00:48:11 +01:00
Sergey M․ 5db9df622f
[life:embed] Use native hls 2016-05-23 04:22:09 +06:00
Sergey M․ 5181759c0d
[life] Update _VALID_URL 2016-05-23 04:00:08 +06:00
Sergey M․ e54373204a
[lifenews] Fix metadata extraction 2016-05-23 03:44:04 +06:00
remitamine 102810ef04 [voxmedia] fix volume embed extraction 2016-05-22 20:37:35 +01:00
Yen Chi Hsuan 78d3b3e213
[generic] Improve Livestream detection (closes #2234) 2016-05-23 01:40:11 +08:00
Yen Chi Hsuan 7a46542f97
[livestream] Video IDs should always be strings (#2234) 2016-05-23 01:40:11 +08:00
Yen Chi Hsuan eb7941e3e6
[compat] Fix for XML with <!DOCTYPE> in Python 2.7 and 3.2
Such XML documents cause DeprecationWarning if python is run
with `-W error`
2016-05-23 01:40:11 +08:00
remitamine db3b8b2103 [tf1] add support for more related web sites 2016-05-22 17:03:17 +01:00
remitamine c5f5155100 [wat] extract all formats 2016-05-22 17:03:17 +01:00
Yen Chi Hsuan 4a12077855
[genric] Eliminate duplicated video URLs (closes #6562) 2016-05-22 22:23:20 +08:00
Thor77 70346165fe [bandcamp] raise ExtractorError when track not streamable (#9465)
* [bandcamp] raise ExtractorError when track not streamable

* [bandcamp] update md5 for second test

* don't rely on json-data, but just check for 'file'

* don't rely on presence of 'file'
2016-05-22 14:15:39 +08:00
Sergey M․ e9297256d4
[update] Fix youtube-dl.exe updating from arbitrary directory (Closes #2718) 2016-05-22 10:06:45 +06:00
Sergey M․ c6b9cf05e1
[utils] Do not fail on unknown date formats in unified_strdate 2016-05-22 08:28:41 +06:00
Sergey M․ 4d8819d249
[extractor/generic] Add support for theplatform embeds (Closes #8636, closes #9476) 2016-05-22 06:52:39 +06:00
Sergey M․ 898f4b49cc
[theplatform] Add _extract_urls 2016-05-22 06:47:22 +06:00
Sergey M․ 0150a00f33
[cc] Add test for tosh.cc (Closes #9566) 2016-05-22 02:58:41 +06:00
TRox1972 c8831015f4
[ComedyCentral] Add support for tosh.cc.com and cc.com/video-clips 2016-05-22 02:55:10 +06:00
Sergey M․ 92d221ad48
[periscope] Update uploader_id (Closes #9565) 2016-05-22 02:39:15 +06:00
Sergey M․ 0db9a05f88
[periscope:user] Adapt to layout changes (Closes #9563) 2016-05-22 02:15:56 +06:00
Philipp Hagemeister e03b35b8f9 release 2016.05.21.2 2016-05-21 21:47:39 +02:00
Philipp Hagemeister 598869afb1 release 2016.05.21.1 2016-05-21 21:27:00 +02:00
Philipp Hagemeister c8cc3745fb release 2016.05.21 2016-05-21 21:18:59 +02:00
Jaime Marquínez Ferrándiz 4c718d3c50 [rtve] Recognize 'filmoteca' URLs 2016-05-21 17:37:35 +02:00
Yen Chi Hsuan 115c65793a
[jwplatform] Don't fail with RTMP URLs without mp4:, mp3: or flv: 2016-05-21 13:50:38 +08:00
Yen Chi Hsuan 661d46b28f
[cbslocal] Add new extractor (closes #9522) 2016-05-21 13:40:45 +08:00
Yen Chi Hsuan 5ce3d5bd1b
[sendtonews] Add new extractor
Used in CBSLocal. Part of #9522
2016-05-21 13:39:42 +08:00
Yen Chi Hsuan 612b5f403e
[jwplatform] Improved m3u8 and rtmp support
Changes made for SendtoNewsIE. Part of #9522
2016-05-21 13:38:01 +08:00
Yen Chi Hsuan 9f54e692d2
[anvato] Add new extractor
Used in CBSLocal (#9522)
2016-05-21 13:18:29 +08:00
Yen Chi Hsuan 7b2fcbfd4e
[common] Skip TYPE=CLOSED-CAPTIONS lines in m3u8 manifests
According to [1], valid values for TYPE are AUDIO, VIDEO, SUBTITLES
and CLOSED-CAPTIONS. Such a value is found in Anvato master playlists,
though I don't use _extract_m3u8_formats() in the end.

Part of #9522.

[1] https://tools.ietf.org/html/draft-pantos-http-live-streaming-19#section-4.3.4.1
2016-05-21 13:16:28 +08:00
Yen Chi Hsuan 16da9bbc29
[common] Add _m3u8_meta_format() template
For extractors who handle m3u8 manifests by themselves. (eg., AnvatoIE)

Part of #9522
2016-05-21 13:15:28 +08:00
Sergey M․ c8602b2f9b
[nrk] Unquote subtitles' URLs 2016-05-21 05:09:16 +06:00
Sergey M․ b219f5e51b
[brightcove:new] Improve error reporting 2016-05-21 00:59:06 +06:00
Sergey M․ 1846e9ade0
[localnews8] Fix extractor (Closes #9539) 2016-05-20 22:31:08 +06:00
TRox1972 6756602be6
[LocalNews8] add extractor (Closes #9200) 2016-05-20 22:10:13 +06:00
Sergey M․ 6c114b1210
[extractor/generic] Remove generic id and title from wistia extractionand update tests 2016-05-20 21:55:35 +06:00
Sergey M․ 7ded6545ed
[extractor/generic] Add test for wistia standard embed 2016-05-20 21:43:36 +06:00
Sergey M․ aa5957ac49
[extractor/generic] Add support for async wistia embeds (Closes #9549) 2016-05-20 21:33:31 +06:00
remitamine 64413f7563 [cbc] fix extraction for flv only videos(fixes #5309) 2016-05-20 16:21:23 +01:00
Sergey M․ 45f160a43c
[wistia] Improve hls support 2016-05-20 21:16:08 +06:00
Sergey M․ 36ca2c55db
[wistia] Skip storyboard and improve extraction 2016-05-20 21:04:01 +06:00
Sergey M․ f0c96af9cb
[wistia] Add alias and modernize 2016-05-20 20:55:10 +06:00
Yen Chi Hsuan 31a70191e7
[cbc] Add the test case from #5156 2016-05-20 19:04:50 +08:00
Yen Chi Hsuan ad96b4c8f5
[common] Extract audio formats in SMIL
Found in http://www.cbc.ca/player/play/2657631896

Closes #5156
2016-05-20 19:02:53 +08:00
Yen Chi Hsuan 043dc9d36f
[cbc] Fix for old-styled URLs
The URL http://www.cbc.ca/player/News/ID/2672225049/ (#6342) redirects
to http://www.cbc.ca/player/play/2672224672, while youtube-dl wasn't
able to handle it correctly.
2016-05-20 18:39:54 +08:00
remitamine 52f7c75cff [cbc] extract http formats and update tests 2016-05-20 06:58:46 +01:00
Sergey M․ f6e588afc0
[24video] Fix description extraction 2016-05-20 08:53:04 +06:00
remitamine a001296703 [learnr] Add new extractor(closes #4284) 2016-05-19 18:18:03 +01:00
Sergey M․ dd81769c62
[ndtv] Fix extraction 2016-05-19 04:34:19 +06:00
Sergey M․ 46bc9b7d7c
[utils] Allow None in remove_{start,end} 2016-05-19 04:31:30 +06:00
remitamine b78531a36a [formula1] Add new extractor(closes #3617) 2016-05-18 22:24:46 +01:00
Sergey M․ 11e6a0b641
[nfb] Modernize and extract subtitles 2016-05-18 00:25:15 +06:00
Sergey M․ 15cda1ef77
[nfb] Fix uploader extraction 2016-05-17 23:46:47 +06:00
Yen Chi Hsuan 055f0d3d06
[abcnews] Added a new extractor (closes #3992)
Related: #6108, #8664, #9459
2016-05-17 15:38:57 +08:00
Yen Chi Hsuan cdd94c2eae
[utils] Check for None values in SOCKS proxy
Originally reported at
https://github.com/rg3/youtube-dl/pull/9287#issuecomment-219617864
2016-05-17 14:38:15 +08:00
Philipp Hagemeister 36755d9d69 release 2016.05.16 2016-05-16 17:25:47 +02:00
Sergey M․ f7199423e5
[groupon] Add support for Youtube embeds (Closes #9508) 2016-05-16 00:30:13 +06:00
Sergey M․ a0a81918f1
[collegehumor] Remove extractor
It now uses brightcove
2016-05-15 22:07:51 +06:00
Yen Chi Hsuan 5572d598a5
[hearthisat] Update the first test 2016-05-15 15:44:04 +08:00
Yen Chi Hsuan cec9727c7f
[hearthisat] Detect invalid download links (fixes #9440) 2016-05-15 15:35:31 +08:00
Yen Chi Hsuan 79298173c5
[utils] Fix getheader in urlhandle_detect_ext
Fixes #7049, related to #9440
2016-05-15 15:34:50 +08:00
Sergey M․ 69c9cc2716
[xvideos] Extract html5 player formats (Closes #9495) 2016-05-15 03:38:04 +06:00
Sergey M․ ed56f26039
[extractor/common] Improve name extraction for m3u8 formats 2016-05-15 03:34:35 +06:00
Sergey M․ 6f41b2bcf1
[extractor/generic] Improve 3qsdn embeds support (Closes #9453) 2016-05-14 23:58:25 +06:00
Sergey M․ cda6d47aad
[utils] Simplify integer conversion in js_to_json 2016-05-14 23:41:57 +06:00
Sergey M․ 5d39176f6d
[extractor/generic:3qsdn] Add support for embeds 2016-05-14 23:40:34 +06:00
Sergey M․ 5c86bfe70f
[3qsdn] Add extractor 2016-05-14 23:35:03 +06:00
Sergey M․ ca950f49e9
[ora] Revert extraction to regexes
It's less fragile than using js_to_json with ora js
2016-05-14 20:45:18 +06:00
Sergey M․ 89ac4a19e6
[utils] Process non-base 10 integers in js_to_json 2016-05-14 20:39:58 +06:00
felix 640eea0a0c
[ora] minimise fragile regex shenanigans; recognise unsafespeech.com URLs 2016-05-14 20:13:06 +06:00
felix bd1e484448
[utils] js_to_json: various improvements
now JS object literals like { /* " */ 0: ",]\xaa<\/p>", } will be correctly converted to JSON.
2016-05-14 20:12:39 +06:00
Yen Chi Hsuan a834622b89 Merge pull request #9492 from jwilk/teamcoco
[teamcoco] Fix base64 regexp
2016-05-14 20:02:40 +08:00
Jakub Wilk 66e7ace17a Don't hardcode errno constant
The value of ENOENT is architecture-dependent, so don't assume it's
always 2.
2016-05-14 13:41:41 +02:00
Jakub Wilk 791ff52f75 [teamcoco] Fix base64 regexp 2016-05-14 13:19:54 +02:00
Sergey M․ b5abf86148
[cinemassacre] Remove extractor (Closes #9457)
It now uses jwplatform
2016-05-14 04:53:14 +06:00
Sergey M․ 134c6ea856
[YoutubeDL] Sanitize url for url and url_transparent extraction results 2016-05-14 04:46:38 +06:00
remitamine 0730be9022 [sina] fix extraction(fixes #1146) 2016-05-13 20:25:01 +01:00
Sergey M․ 96c2e3e909
[imdb] Improve extraction 2016-05-13 23:25:05 +06:00
Sergey M․ f196508f7b
[imdb] Relax _VALID_URL (Closes #9481) 2016-05-13 22:19:00 +06:00
Yen Chi Hsuan cc1028aa6d
[openload] Fix extraction (closes #9472) 2016-05-13 18:11:08 +08:00
remitamine ad55e10165 [brightcove] change the protocol for m3u8 formats to m3u8_native 2016-05-13 08:35:38 +01:00
remitamine 18cf6381f6 [nrk] extract m3u8 formats 2016-05-13 08:05:28 +01:00
remitamine cdf32ff15d [extractors] add import for UstudioEmbedIE 2016-05-13 05:25:32 +01:00
remitamine 99d79b8692 [ustudio] add support ustudio app/embed urls 2016-05-13 05:21:45 +01:00
remitamine b9e7bc55da [mgtv] extract http formats 2016-05-12 22:46:23 +01:00
Sergey M․ d8d540cf0d
[nrk] Rework extractor (Closes #9470) 2016-05-13 02:07:12 +06:00