xarantolus
c0a1a8926d
Use better regex for all fixed extraction types
2020-09-22 20:52:52 +02:00
xarantolus
bea9b00588
Fix regex for other variable declaration type
...
This now supports declarations like `window["ytInitialData"] = ...` and `var ytInitialData = ...`
2020-09-06 09:22:49 +02:00
xarantolus
f442082a50
[youtube] More general title extraction
...
Seems like this attribute is moved every few weeks, so we just extract both and use the one that is present.
2020-09-06 09:22:48 +02:00
xarantolus
1f93faf60b
[youtube] Make ytcfg.set
config extraction non-fatal
...
If the markup of the page changes in the future, it might be possible that _FEED_DATA still works, but the other regex does not. SInce it is not necessary for the first page of videos, we make sure the program doesn't exit before extracting them.
TL;DR: Extract the first video page even if there are problems
2020-09-06 09:22:47 +02:00
xarantolus
299056ad52
Fix python2 compatibility and title extraction
2020-09-06 09:22:47 +02:00
xarantolus
f536080701
Run formatter
2020-09-06 09:22:46 +02:00
xarantolus
5c430b67bd
[youtube] Fix feed extraction
...
In order to extract videos from further pages, we need to get various variables that are in an argument to the `ytcfg.set` call in a script on the feed page.
2020-09-06 09:22:46 +02:00
xarantolus
e03b4f3e05
[youtube] Make search extraction less dependent on json schema.
...
If an object looks like a video (it has a `videoId` key), assume that it is.
2020-09-06 09:22:30 +02:00
xarantolus
19f671f88b
[youtube] Move search URL extraction to appropriate extractor
2020-09-06 09:21:26 +02:00
xarantolus
b948643f9c
[youtube] Fix extraction of search urls ( closes ytdl-org/youtube-dl#25696 )
2020-09-06 09:21:25 +02:00
Sergey M․
50e9fcc1fd
[nrktv:episode] Improve video id extraction ( closes #25594 , closes #26369 , closes #26409 )
2020-09-06 12:43:50 +07:00
random-nick
16ee69c1b7
[youtube] Fix age gate content detection ( #26100 ) ( closes #26152 , closes #26311 , closes #26384 )
2020-09-06 11:44:53 +07:00
Sergey M․
67171ed7e9
[youtube:user] Extend _VALID_URL ( closes #26443 )
2020-09-06 11:31:28 +07:00
TheRealDude2
62ae19ff76
[xhamster] Improve initials regex ( #26526 ) ( closes #26353 )
2020-09-06 11:10:27 +07:00
Sergey M․
5ed05f26ad
[svtplay] Fix svt id extraction ( closes #26425 , closes #26428 , closes #26438 )
2020-09-06 10:45:57 +07:00
Sergey M․
841b683804
[twitch] Rework extractors ( closes #12297 , closes #20414 , closes #20604 , closes #21811 , closes #21812 , closes #22979 , closes #24263 , closes #25010 , closes #25553 , closes #25606 )
...
* Switch to GraphQL.
+ Add support for collections.
+ Add support for clips and collections playlists.
2020-09-06 10:45:34 +07:00
Remita Amine
f5863a3ea0
[biqle] improve video_ext extraction
2020-08-27 19:20:41 +01:00
Sergey M․
10709fc7c6
[xhamster] Extend _VALID_URL ( closes #25927 )
2020-08-12 21:51:50 +07:00
TheRealDude2
a7e348556a
[xhamster] Fix extraction ( closes #26157 ) ( #26254 )
2020-08-12 21:42:17 +07:00
JChris246
6cb30ea5ed
[xhamster] Extend _VALID_URL ( closes #25789 ) ( #25804 )
2020-08-12 21:37:22 +07:00
Sergey M․
e450f6cb63
[youtube] Fix sigfunc name extraction ( closes #26134 , closes #26135 , closes #26136 , closes #26137 )
2020-07-28 05:05:38 +07:00
MRWITEK
a115e07594
[youtube] Improve description extraction ( closes #25937 ) ( #25980 )
2020-07-14 12:01:15 +01:00
Sergey M․
718393c632
[wistia] Restrict embed regex ( closes #25969 )
2020-07-11 18:27:19 +07:00
Glenn Slayden
07af16b92e
[youtube] Prevent excess HTTP 301 ( #25786 )
2020-07-01 02:56:16 +07:00
Sergey M․
e942cfd1a7
[youtube:playlists] Extend _VALID_URL ( closes #25810 )
2020-06-28 10:30:03 +07:00
Remita Amine
9a7e5cb88a
[bellmedia] add support for cp24.com clip URLs( closes #25764 )
2020-06-23 15:09:13 +01:00
Sergey M․
2391941f28
[brightcove] Improve embed detection ( closes #25674 )
2020-06-16 17:38:25 +07:00
Sergey M․
ee0b726cd7
[youtube] Force old layout ( closes #25682 , closes #25683 , closes #25680 , closes #25686 )
2020-06-16 06:17:53 +07:00
Sergey M․
dbeafce5d5
[youtube] Fix categories and improve tags extraction
2020-06-16 03:13:39 +07:00
Sergey M․
a6211d237b
[youtube] Fix uploader id and uploader URL extraction
2020-06-16 02:43:09 +07:00
Sergey M․
7b16239a49
[youtube] Improve view count extraction
2020-06-16 02:38:45 +07:00
Sergey M․
37357d21a9
[youtube] Fix upload date extraction
2020-06-16 02:37:19 +07:00
Sergey M․
b477fc1314
[youtube] Fix thumbnails extraction and remove uploader id extraction warning ( closes #25676 )
2020-06-16 02:29:04 +07:00
Sergey M․
d84b21b427
[youtube] Fix playlist and feed extraction ( closes #25675 )
2020-06-16 02:01:12 +07:00
Philipp Hagemeister
48bd042ce7
[facebook] Support single-video ID links
...
I stumbled upon this at https://www.facebook.com/bwfbadminton/posts/10157127020046316 . No idea how prevalent it is yet.
2020-06-14 13:17:51 +02:00
Sergey M․
84213ea8d4
[youtube] Extract chapters from JSON ( closes #24819 )
2020-06-06 04:22:10 +07:00
Sergey M․
562de77f41
[kaltura] Add support for multiple embeds on a webpage ( closes #25523 )
2020-06-06 02:14:35 +07:00
Sergey M․
a5b6102ea8
[tele5] Bypass geo restriction
2020-06-06 01:45:05 +07:00
Sergey M․
b77888228d
[jwplatform] Add support for bypass geo restriction
2020-06-06 01:44:36 +07:00
Sergey M․
0b1eaec3bc
[tele5] Prefer jwplatform over nexx ( closes #25533 )
2020-06-06 01:35:09 +07:00
Sergey M․
b37e47a3f9
[twitch:stream] Expect 400 and 410 HTTP errors from API
2020-06-06 00:57:40 +07:00
Sergey M․
ce3735df02
[twitch:stream] Fix extraction ( closes #25528 )
2020-06-06 00:55:29 +07:00
Sergey M․
a0455d0ffd
[twitch] Pass v5 accept header and fix thumbnails extraction ( closes #25531 )
2020-06-06 00:12:47 +07:00
Sergey M․
c8b232cc48
[brightcove] Sort imports
2020-06-05 23:35:57 +07:00
Sergey M․
b4eb0bc7bd
[brightcove] Fix subtitles extraction ( closes #25540 )
2020-06-05 23:33:14 +07:00
Matej Dujava
d5147b65ac
[malltv] Add support for sk.mall.tv ( #25445 )
2020-06-01 21:11:31 +07:00
Sergey M․
7b0b53ea69
[twitter:broadcast] Add untitled periscope broadcast test
2020-06-01 20:32:57 +07:00
Sergey M․
7016e24ebe
[periscope] Fix untitled broadcasts ( #25482 )
2020-06-01 20:31:51 +07:00
Sergey M․
bef4688c72
[jwplatform] Improve embeds extraction ( closes #25467 )
2020-05-31 11:10:31 +07:00
Sergey M․
fe515e5c75
[ard:beta] Extend _VALID_URL ( closes #25405 )
2020-05-29 02:01:51 +07:00