Commit graph

18009 commits

Author SHA1 Message Date
Jody Bruchon ecdec1913f Keep download archive in memory for better performance
The old behavior was to open and scan the entire archive file for
every single video download. This resulted in horrible performance
for archives of any remotely large size, especially since all new
video IDs are appended to the end of the archive. For anyone who
uses the archive feature to maintain archives of entire video
playlists or channels, this meant that all such lists with newer
downloads would have to scan close to the end of the archive file
before the potential download was rejected. For archives with tens
of thousands of lines, this easily resulted in millions of line
reads and checks over the course of scanning a single channel or
playlist that had been seen previously.

The new behavior in this commit is to preload the archive file
into a binary search tree and scan the tree instead of constantly
scanning the file on disk for every file. When a new download is
appended to the archive file, it is also added to this tree. The
performance is massively better using this strategy over the more
"naive" line-by-line archive file parsing strategy.

The only negative consequence of this change is that the archive
in memory will not be synchronized with the archive file on disk.
Running multiple instances of the program at the same time that
all use the same archive file may result in duplicate archive
entries or duplicated downloads. This is unlikely to be a serious
issue for the vast majority of users. If the instances are not
likely to try to download identical video IDs then this should
not be a problem anyway; for example, having two instances pull
two completely different YouTube channels at once should be fine.

Signed-off-by: Jody Bruchon <jody@jodybruchon.com>
2020-09-17 14:22:07 -04:00
Unknown 7ac0ba50ce [hotstar] regex fix 2020-09-17 14:00:03 +02:00
Unknown fe84e2a391 [skip travis] winver 2020-09-16 14:22:51 +02:00
Unknown 17cb02d0c6 bump version 2020.09.16 2020-09-16 13:55:35 +02:00
Unknown 78895bd3a1 [Core] hls manifests, dynamic mpd 2020-09-16 13:00:41 +02:00
Tom-Oliver Heidel 08676fb591 Merge branch 'Zocker1999NET-ext/remuxe-video' 2020-09-16 10:31:30 +02:00
Tom-Oliver Heidel cd93279de8 Merge branch 'ext/remuxe-video' of https://github.com/Zocker1999NET/youtube-dl into Zocker1999NET-ext/remuxe-video 2020-09-16 10:30:38 +02:00
Tom-Oliver Heidel 89233ccbfb
Merge pull request #110 from JensTimmerman/patch-5
Update README.md
2020-09-14 15:12:26 +02:00
Jens Timmerman 8a92dee72c
Update README.md
cleanup + typo fix
2020-09-14 13:59:51 +02:00
Tom-Oliver Heidel 04e2a14b65 Merge branch 'tpikonen-elonet' 2020-09-14 13:26:09 +02:00
Tom-Oliver Heidel c11c64f318 Merge branch 'elonet' of https://github.com/tpikonen/youtube-dl into tpikonen-elonet 2020-09-14 13:24:18 +02:00
Tom-Oliver Heidel 4c7d0c13e1 Merge branch 'fix-mitele' of https://github.com/DjMoren/youtube-dl 2020-09-14 13:22:50 +02:00
Tom-Oliver Heidel b4b3a22dae Merge branch 'DjMoren-fix-mitele' 2020-09-14 13:20:00 +02:00
Tom-Oliver Heidel acdb1a4ec6 Merge branch 'arbitrary-merges' of https://github.com/fstirlitz/youtube-dlc 2020-09-14 13:04:24 +02:00
Unknown 1985f657e5 Merge branch 'ytdl-org-master' 2020-09-14 12:54:54 +02:00
felix d03cfdce1b Support arbitrary stream merges
With this change, the merge operator may join any number of media streams,
video or audio. The streams are downloaded in the order specified.

Also, fix the metadata post-processor so that it doesn't leave out
any streams.
2020-09-14 12:53:09 +02:00
Unknown e69dd78090 merge ytdl-master 2020-09-14 12:50:56 +02:00
Tom-Oliver Heidel 0e0b56a290
Merge pull request #105 from JensTimmerman/patch-3
Update README.md
2020-09-14 07:20:25 +02:00
Sergey M․ e8c5d40bc8
release 2020.09.14 2020-09-14 03:37:36 +07:00
Sergey M․ ca7ebc4e5e
[ChangeLog] Actualize
[ci skip]
2020-09-14 03:35:18 +07:00
Sergey M․ bff857a8af
[postprocessor/embedthumbnail] Fix issues (closes #25717)
* Fix WebP with wrong extension processing
* Fix embedding of thumbnails with % character in path
2020-09-14 03:28:31 +07:00
Alex Merkel a31a022efd
[postprocessor/embedthumbnail] Add support for non jpeg/png thumbnails (closes #25687) 2020-09-14 03:10:01 +07:00
Jens Timmerman 893afc2ca8
Update README.md 2020-09-13 17:39:53 +02:00
Sergey M․ 45f6362464
[rtlnl] Extend _VALID_URL for new embed URL schema 2020-09-13 21:42:06 +07:00
Derek Land 97f34a48d7
[rtlnl] Extend _VALID_URL (#26549) (closes #25821) 2020-09-13 21:38:16 +07:00
Daniel Peukert ea74e00b3a
[youtube] Fix empty description extraction (#26575) (closes #26006) 2020-09-13 21:23:21 +07:00
Sergey M․ 06cd4cdb25
[srgssr] Extend _VALID_URL (closes #26555, closes #26556, closes #26578) 2020-09-13 21:07:25 +07:00
Sergey M․ da2069fb22
[googledrive] Use redirect URLs for source format (closes #18877, closes #23919, closes #24689, closes #26565) 2020-09-13 20:49:32 +07:00
Tom-Oliver Heidel 3796554609
Merge pull request #102 from blackjack4494/gdcvault-fix
[gdcvault] fix extractor
2020-09-13 15:22:33 +02:00
Unknown 4b819d1454 flake8 2020-09-13 15:21:59 +02:00
Unknown 10bbf2c48d [skip travis] bump version 2020-09-13 15:16:38 +02:00
Sergey M․ 95c9810015
[svtplay] Fix id extraction (closes #26576) 2020-09-13 18:59:37 +07:00
Tom-Oliver Heidel acd0a71894
Merge pull request #96 from blackjack4494/kakao-new-apis
[kakao] new apis
2020-09-13 12:47:12 +02:00
Tom-Oliver Heidel d8edca1e56
Merge pull request #95 from blackjack4494/daserste-ndr
[ndr] support Daserste ndr
2020-09-13 12:36:17 +02:00
Unknown 4cd3053c4f [kakao] new apis
there are also ageLimit and GeoBlock attributes provided by api_json if needed
2020-09-13 12:31:36 +02:00
Remita Amine b03eebdb6a [redbulltv] improve support for rebull.com TV localized URLS(#22063) 2020-09-13 11:26:11 +01:00
Tom-Oliver Heidel ef0389f426
Merge pull request #94 from blackjack4494/conv_subs_when_skipped
[youtube] Convert subs when download is skipped
2020-09-13 11:17:11 +02:00
Unknown f791b41970 flake8 2020-09-13 11:08:02 +02:00
Unknown 57df9f53e0 [youtube] convert subtitles with --skip-download 2020-09-13 11:03:48 +02:00
Unknown 84da5d84b0 add json3 back but don't use it as default. 2020-09-13 11:02:07 +02:00
Unknown fc0999c695 exclude json3 for now 2020-09-13 09:22:24 +02:00
Unknown d23f9ec7eb [gdcvault] fix extractor
at least when not logged in?
2020-09-13 01:04:02 +02:00
Tom-Oliver Heidel dc6193cb22
[skip travis] create faq.md
how to redirect to another extractor
2020-09-13 00:47:45 +02:00
Remita Amine 1f7675451c [redbulltv] Add support for new redbull.com TV URLs(closes #22037)(closes #22063) 2020-09-12 19:27:58 +01:00
felix de7c27cd25 [dispeak] mark the "slides" stream as video-only 2020-09-12 13:41:23 +02:00
tfvlrue aa27253556
[soundcloud] Reduce pagination limit to fix 502 Bad Gateway errors when listing a user's tracks. (#26557)
Per the documentation here https://developers.soundcloud.com/blog/offset-pagination-deprecated the maximum limit is 200, so let's respect that (even if a higher value sometimes works).

Co-authored-by: tfvlrue <tfvlrue>
2020-09-12 09:35:11 +00:00
Unknown e02cdb31bb [skip travis] update supportedsites and make_win.bat 2020-09-12 08:22:59 +02:00
Unknown ac4df0908a [skip travis] bump version 2020-09-12 06:17:02 +02:00
Tom-Oliver Heidel b4d85aad9c Merge branch 'skyme5-tiktok' 2020-09-12 05:51:27 +02:00
Tom-Oliver Heidel 6cef08ad00 Merge branch 'tiktok' of https://github.com/skyme5/youtube-dl into skyme5-tiktok 2020-09-12 05:49:52 +02:00