AI noob - your thoughts on models

Apologies if I have a bad tag here, I can change it.

Thus far I have “dabbled” with Stable-Diffusion, whisper, and deepspeech. Opening up a thread for idea generation on my side, or what are additional models I can add to my tool box. What references have you referred to/read to learn. Etc etc. I feel like AI is peeling away at an onion and for a new entrant there is just too much information.

For example one item I never understood. How to people generate photos based on TV characters (whether real or animated.) Does that come down to training? As it stands now, text to image interests me the most based on the amazing things I’ve seen online.

Aside from y’alls thoughts from above I need to explore a way for character recognition in a video - capture a screenshot - then perhaps get a large enough sample for training but that may be worth its own thread

Please, talk about yourself, what you’ve done and your experiences have been. Here to listen (read).

Best,

WWED

PS:
Providing my hardware below in the event there is a chance there is a hardware constraint.

11600k
3080Ti
32GB Ram

If you get a bunch of images of a TV character you can do things like detection or transfer style to generate images of them.

I’d bet there’s a video or tutorial already if you search around.

1 Like

Youtube is really good.
This guy is great if you want quick demos and references to papers

This guy is like the Gamers Nexus/buildzoid of ML papers. He goes deep and explains AI research

This guy teaches you how to program and make your own neural nets

Data Science/ML Blogs and newsletters are great places to get info

This forum also has some people exploring AI

https://forum.level1techs.com/t/chatgpt-the-good-the-bad-the-experience-so-far/194122?u=bedhedd

Tl;dr large labeled datasets of examples. You’ll notice that for some prompts stable diffusion doesn’t know/isn’t able to generate viable images. @rv6502 was posting some of their stable diffusion generations on the L1 community discord

1 Like

I cannot recommend enough https://course.fast.ai/ if you really want to understand every part of what is deep learning/Machine learning. This is a proper course .

Concerning generating images based on character (real or fictitious) you might want to read up on textual inversion, hypernetwork and LoRA . Automatic1111’s documentation is slightly better then before.

In term of real world application i wrote a small keras based REST end point that i can just push images and get enumeration of items in them (very simple yet useful tool for investigation) and an auto transcriber based on whisper .

PS: sorry for almost necro’ing this topic