Here is [Gwern Danbooru 2018 dataset](https://www.gwern.net/Danbooru2018) with **2.536.329** Danbooru images
**till 01.01.2019 rating:safe resized to 512x512 px** with some meta-information
used for image recognition training **in zipped format, acceptible to all torrent clients.**
Meta information included in "initial" JSON format and "normalized" 3-tables CSV
(posts with some additional stats, taglist with some additional info, tags occurrences in posts).
There is the [next volume for 2019-2021](https://nyaa.si/view/1482846).
NOTE [BOORU CHARS](https://nyaa.si/view/1206322) - my compilation of 1.227.622 thumbnails (also 512x512px)
for best art images from several sources (only ~360.000 taken from this release)
enriched with much more calculated metadata, including face detected.
Also I develop a BOORU CHAR dataset with 1280px samples
[release 2021](https://nyaa.si/view/1384820) , [release 2015](https://nyaa.si/view/1468367), [release 2022](https://nyaa.si/view/1547662), [release 2023](https://nyaa.si/view/1740396)
and 2560/2480/1920px [release 2024](https://nyaa.si/view/1927862), to be continued.
Comments - 2
Astral
SomaHeir