Miaoshouai

Models by this creator

📉

Florence-2-base-PromptGen-v1.5

Florence-2-base-PromptGen is an advanced image captioning tool based on the Microsoft Florence-2 Model Base and fine-tuned by MiaoshouAI. It is trained on images and cleaned tags from Civitai to improve the tagging experience and accuracy of prompts used to generate these images. The model is a significant upgrade from previous versions, adding new caption instructions like and while improving accuracy. Model inputs and outputs Inputs Image**: An image to be captioned Outputs Detailed captions**: Descriptions of the image in varying levels of detail, including subject positions and text from the image Image tags**: Structured tags and prompts that can be used to recreate the image Capabilities Florence-2-base-PromptGen excels at generating high-quality, detailed image captions and tags. It can provide very granular descriptions of an image's contents, down to the positions of subjects and text within the frame. The model is also lightweight and memory-efficient, allowing for fast generation on modest hardware. What can I use it for? Florence-2-base-PromptGen is an ideal tool for improving the tagging and prompting workflow when training image generation models like those in the Flux ecosystem. It can eliminate the need to run separate tagging tools, boosting speed and efficiency. The model's detailed captions and tags can also be useful for other applications like visual search, image organization, and data annotation. Things to try Try experimenting with the different caption instructions like and to see how the level of detail in the output changes. You can also test the model's ability to read and incorporate text from the image into the captions. Finally, see how the generated tags and prompts perform when used to recreate the original image with a Flux-based generation model.

Updated 9/16/2024

Text-to-Text