Commit graph

2830 commits

Author SHA1 Message Date
Marcel 65f91148fc
[parsing] search for case-insensitive tag names 2023-03-18 18:38:48 +01:00
Marcel 6169b3eca8
[parsing] replace HTMLCommentRanges with HTMLIgnoreRanges
* ignore matches within CDATA elements and comments
2023-03-18 18:38:47 +01:00
Marcel 29278a3323
[parsing] fix return value 2023-03-18 18:38:46 +01:00
Marcel 7a67a2028f
[parsing] tweak tag regex 2023-03-18 18:38:46 +01:00
Marcel dbf350c122
[parsing] return unclosed matched tags 2023-03-18 18:38:45 +01:00
Marcel 8451074b50
[parsing] fix: don't push unmatched void tags onto queue 2023-03-18 18:38:45 +01:00
Marcel 176a156c65
[parsing] rework interface, implemented all get_element(s) functions + extract_attributes() as MatchingElementParser class methods and improve performance 2023-03-18 18:38:44 +01:00
Marcel 5e3894df3f
[parsing] add new module containing various HTML parser classes as replacement for utils.get_html_... functions
* performance is mostly better for large HTML data and on PyPy
2023-03-18 18:38:43 +01:00
Marcel af03fa4542
[utils] more forgiving html parsing + unit tests 2023-03-18 18:38:43 +01:00
Lesmiscore c14af7a741
[extractor/iwara] Overhaul extractors (#6557)
Authored by: Lesmiscore
2023-03-18 23:29:02 +09:00
viktor-enzell 9a06b7b189
[extractor/drtv] Fix radio page extraction (#6552)
Authored by: viktor-enzell
2023-03-18 13:06:46 +00:00
bashonly 216bcb66d7
[extractor/tiktok] Improve TikTokLive extractor (#6520)
Closes #6459
Authored by: bashonly
2023-03-16 19:54:56 +00:00
bashonly 460da07439
[extractor/genius] Add support for articles (#6474)
Closes #6465
Authored by: bashonly
2023-03-16 19:54:25 +00:00
bashonly 03025b6e10
[extractor/mediastream] Improve WinSports and embed extraction (#6426)
Closes #6419, Closes #6527
Authored by: bashonly
2023-03-16 19:53:18 +00:00
Nicholas Defranco 071670cbea
[extractor/youtube] Fix parsing comment_count (#6523)
Closes #5849
Authored by: nick-cd
2023-03-15 04:51:14 +05:30
coletdjnz 607510b9f2
[extractor/youtube] Handle incomplete initial data from watch page (#6510)
Authored by: coletdjnz
2023-03-13 01:43:37 +00:00
pukkandan 98ac902c49
[dependencies/Cryptodome] Fix __bool__
Bug in 65f6e80780
2023-03-13 05:21:43 +05:30
unbeatable-101 cbfe2e5cbe
[extractor/nebula] Add beta.nebula.tv (#6516)
Authored by: unbeatable-101
2023-03-13 04:55:05 +05:30
Chris Caruso cf9fd52fab
[extractor/jwplatform] Update _extract_embed_urls (#6383)
Authored by: carusocr
2023-03-12 23:37:34 +05:30
JChris246 80ea6d3dea
[extractor/Parler] Rewrite extractor (#6446)
Authored by: JChris246
Closes #6068
2023-03-12 23:32:17 +05:30
Joshua Lochner 1e3c2b6ec2
[extractor/medaltv] Fix clips (#6502)
Closes #6489
Authored by: xenova
2023-03-12 23:08:27 +05:30
Ha Tien Loi 026435714c
[extractor/LastFM] Rewrite playlist extraction (#6379)
Authored by: hatienl0i261299, pukkandan
Closes #5975
2023-03-12 22:50:40 +05:30
Ha Tien Loi 0181b9a1b3
[extractor/thesun] Update _VALID_URL (#6522)
Authored by: hatienl0i261299
Closes #6479
2023-03-12 22:04:22 +05:30
pukkandan e389d172b6
Fix 2a23d92d9e
Closes #6517
2023-03-12 14:47:05 +05:30
pukkandan 2a23d92d9e
[extractor/youtube] Construct fragment list lazily
Building fragment list for all formats take significant time for large videos
2023-03-11 22:46:47 +05:30
pukkandan 86cb922118
[extractor/youtube] Add extractor-arg include_duplicate_formats 2023-03-11 22:34:13 +05:30
Lesmiscore c795c39f27
[extractor/youtube] Add client name to format_note when -v (#6254)
Authored by: Lesmiscore, pukkandan
2023-03-11 22:33:23 +05:30
vampirefrog 7a6c8a0807
[extractor/rokfin] Re-construct manifest url (#6507)
Authored by: vampirefrog
2023-03-11 22:22:36 +05:30
Daniel Vogt 89dbf08483
[extractor/opencast] Fix format bug (#6512)
Authored by: C0D3D3V
2023-03-11 20:40:32 +05:30
pukkandan e6ab678e36
[extractor/hidive] Fix login
Fixes https://github.com/yt-dlp/yt-dlp/issues/6493#issuecomment-1462906556
2023-03-10 17:27:43 +05:30
pukkandan ab1de9cb1e
Support loading info.json with a list at it's root 2023-03-10 14:15:13 +05:30
makeworld 871c907454
[extractor/cbc:gem] Update _VALID_URL (#6499)
Authored by: makeworld-the-better-one
Closes #6395
2023-03-10 13:23:19 +05:30
Elyse 0551511b45
[extractor/twitch] Fix is_live (#6500)
Closes #6494
Authored by: elyse0
2023-03-10 12:42:38 +05:30
pukkandan c9abebb851
[extractor/youtube] Bypass throttling for -f17
and related cleanup

Thanks @AudricV for the finding
2023-03-09 22:13:03 +05:30
pukkandan 66aeaac9aa
[downloader/curl] Fix progress reporting
Bug in 8c53322cda
Closes #6490
2023-03-09 21:58:07 +05:30
Daniel Vogt 3588be59ce
[extractor/opencast] Add ltitools to _VALID_URL (#6371)
Authored by: C0D3D3V
2023-03-09 21:51:39 +05:30
D0LLYNH0 2d5cae9636
[extractor/iq] Set more language codes (#6476)
Authored by: D0LLYNH0
2023-03-09 12:48:14 +05:30
Simon Sawicki 9b7a48abd1
[cookies] Defer extraction of v11 key from keyring
Closes #6082

Authored by: Grub4K
2023-03-08 21:49:24 +01:00
bashonly 01ddec7e66
[postprocessor] Fix chapters if duration is not extracted (#6037)
Authored by: bashonly
2023-03-08 13:10:19 +00:00
bashonly 6f4fc5660f
[extractor/chilloutzone] Fix extractor (#6445)
Closes #6029
Authored by: bashonly
2023-03-08 12:37:34 +00:00
Simon Sawicki 3b479100df
[utils] write_string: Fix noconsole behavior
Ref: https://github.com/pyinstaller/pyinstaller/pull/7217

Authored by: Grub4K
2023-03-07 22:34:07 +01:00
permunkle d4e6ef4077
[extractor/nubilesporn] Add extractor (#6231)
Authored by: permunkle
2023-03-07 00:32:03 +05:30
bashonly c459d45dd4
[extractor/teamcoco] Fix extractor (#6437)
Closes #6339
Authored by: bashonly
2023-03-05 18:36:48 +00:00
github-actions 8729e7b57c Release 2023.03.04
Created by: pukkandan

:ci skip all :ci run dl
2023-03-04 22:24:51 +00:00
pukkandan 392389b7df
[cleanup] Misc 2023-03-05 03:34:55 +05:30
Elyse eb8fd6d044
[extractor/lefigaro] Add extractors (#6309)
Authored by: elyse0
Closes #6197
2023-03-05 03:30:45 +05:30
Ferdinand Bachmann f44cb4e77b
[extractor/tubetugraz] Support --twofactor (#6424) (#6427)
Authored by: Ferdi265
Closes #6424
2023-03-05 03:28:16 +05:30
Elyse 46580ced56
[extractor/tunein] Fix extractors (#6310)
Authored by: elyse0
Closes #2973
2023-03-05 01:35:19 +05:30
Elyse b404712822
[extractor/telecaribe] Add extractor (#6311)
Authored by: elyse0
Closes #6001
2023-03-05 01:11:41 +05:30
Chris Caruso 1f8489cccb
[extractor/lumni] Add extractor (#6302)
Authored by: carusocr
Closes #6202
2023-03-05 00:52:11 +05:30