I agree, I suspect the HuggingFace dataset that I've used is not that randomly distributed and it mostly contains prompts related with those themes. How it works is that I randomly select 5 random prompts from the dataset and use those prompts as seed for new prompts. The complete DeepSeek prompt to generate new prompts can be found here: https://github.com/harisec/llm-dreams
All images in the LLM Dreams gallery are generated using the Flux model from the Black Forest Team, powered by the fal.ai API. The prompts for these images were brainstormed by DeepSeek, drawing inspiration from existing Stable Diffusion prompts found in the Gustavosta/Stable-Diffusion-Prompts dataset on Hugging Face. All the code was written by Claude.
Researchers from Carnegie Mellon University found that it's possible to automatically construct adversarial attacks on LLMs, forcing them to answer any questions and it's possible to generated unlimited number of such attacks, making them very hard to protect against.