Comic strip generator

Single-page web application with FastAPI backend.

Uses DALL-E (v2 and v3), and GPT-4.

Generating three images that should compose a coherent whole poses a problem with DALL-E: It cannot be done, because DALL-E doesn't itself keep context, and the context that can be artificially created using GPT Vision is very rudimentary. Not to worry. We could probably create a three-panel strip with a smaller resolution, but what's the fun in that? Instead, we create four panels arranged in the native square aspect ratio, at the highest resolution, and chop the image, and rearrange the sub-images. This way, we can get 50% higher resolution! (The fourth panel is generated with a synthetic prompt, and promptly discarded.)

Sometimes the pictures do not quite match the prompts -- I'm planning to use GPT Vision to add a reflection step, and re-generate when appropriate. Keep changing the prompts until you are happy with the result.

You can try it here if you want.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
script		script
.gitignore		.gitignore
CNAME		CNAME
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
app.py		app.py
config.py		config.py
favicon.png		favicon.png
index.html		index.html
logger_config.py		logger_config.py
placeholder.png		placeholder.png
quota-exceeded.html		quota-exceeded.html
requirements.txt		requirements.txt
screenshot1.png		screenshot1.png
screenshot2.png		screenshot2.png
screenshot3.png		screenshot3.png
script.js		script.js
style.css		style.css
token_generator.py		token_generator.py
token_verifier.py		token_verifier.py

License

rdancer/comix-generator-demo

Folders and files

Latest commit

History

Repository files navigation

Comic strip generator

About

Topics

Resources

License

Stars

Watchers

Forks

Languages