video2scenario

Forms a tree-like folder dataset with L parts at the lowest level.

Then recursively writes descriptions of scenes with Large Language Models and Image Captioning Models.

The lowest level clips are captioned with VideoLLaVA.

The descriptions are gathered in a list and then the LLM is asked to describe the overall scene. Then the process continutes until the top level.

Any OpenAI-like text completion model can be used for this. In my tests Oobabooga's text generation webui is used as the API endpoint.

User can also provide the master prompt to help the model and edit the resulting descriptions with a Gradio demo interface.

There is also an option to store the resulting corrected output for better fine-tuning the models, for example, using a LoRA.

The Gradio interface has a dropdown to select each description - clip pair, on each level.

The goal of this subproject is to make a DiffusionOverDiffusion dataset to train InfiNet and the future complex script-based text2video models with minimal human labeling efforts.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
app.py		app.py
args.json		args.json
chops_to_folder_dataset.py		chops_to_folder_dataset.py
master_prompt_llava.txt		master_prompt_llava.txt
master_prompt_scene.txt		master_prompt_scene.txt
master_prompt_synopsis.txt		master_prompt_synopsis.txt
requirements.txt		requirements.txt
textgen_config.json		textgen_config.json
video2scenario.ipynb		video2scenario.ipynb
video_chop.py		video_chop.py
video_llava_server.py		video_llava_server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

video2scenario

About

Releases

Packages

Contributors 2

Languages

License

kabachuha/video2scenario

Folders and files

Latest commit

History

Repository files navigation

video2scenario

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages