In the past I have worked on a project called Spoken Wardrobe. The concept was a camera display where the user could speak to the screen/ reflection of themselves and whatever they spoke would be used as a prompt to stable diffusion to create an AI generated inpainting image of the person wearing whatever they wished for.

IMG_3145 2.HEIC.heic

Why?

This project utilized a proxy api hosted on glitch made by Dan O Sullivan. Unfortunatelly that api is no longer working as of recent so I thought maybe I could update my project to use transformers.js

Whisper

Screen Recording 2025-03-21 at 12.13.52 AM.mov

https://hf.co/spaces/Xenova/realtime-whisper-webgpu

Screenshot 2025-03-21 at 12.16.07 AM.png

Screenshot 2025-03-21 at 12.00.04 AM.png

Screenshot 2025-03-20 at 10.13.28 PM.png

Stable Diffusion

Screenshot 2025-03-21 at 12.00.24 AM.png

Future objectives