1 of 26

ВИЗУАЛЬНЫЕ НЕЙРОНКИ

Time to have some fun!

2 of 26

СЕРГЕЙ ШИМА

  • TDI Group Central Asia
  • Халва, Бонстики
  • в маркетинге с 2005
  • в нейронках с 2017

@aimastersme

канал о практике применения ИИ

3 of 26

где искать инструменты

theresanaiforthat.com

futuretools.io

4 of 26

полезные ресурсы

https://www.aiforwork.co/ - коллекция промптов для рабочих задач

https://www.youtube.com/watch?v=wjZofJX0v4M - разобраться в деталях, как работает технология GPT

https://www.youtube.com/watch?v=yPTLhE_isr0 - не скучный научпоп об ИИ революции

https://academy.bothub.chat/ - хороший текстовый курс по ChatGPT

https://www.perplexity.ai/ - спросить, где можно посмотреть/почитать материалы по интересующей теме

5 of 26

как обучают визуальные модели

зеленый

собака

терьер

коричневый нос

коричневые глаза

фон зеленый

иллюстраций

6 of 26

7 of 26

шрифты

управляемость

фотореализм

красота

Stable Diffusion

MIdjourney

Flux

Playground

Ideogram

8 of 26

Универсальный боец. Отлично подходит для рекламных изображений, фонов, креативных КВ, раскадровки, копирования стиля.

Набор моделей для разных задач. Сложный вход, сложно управлять, но можно добиться практически всего, чего задумано.

Эволюция Stable Diffusion. Хорошо слушается промпта. Имеет дополнительные плагины, чтобы обучаться, как выглядят новые объекты и стили.

Отлично слушается команд. Однообразный узнаваемый стиль. Подходит только для концептуализации.

Управление текстом. Использует разные модели для генерации изображений. Отлично годится для создания концепций.

Для работы с текстом и типографикой. Новая модель 2.0 прекрасно слушается промпта и выдаёт достаточно реалистичные изображения. Классно работает со шрифтами.

особенности

9 of 26

MIDJOURNEY

STABLE DIFFUSION

(FLUX)

DALL-E 3

10 of 26

11 of 26

MIDJOURNEY

2022

2023

november

november

12 of 26

ВНИМАНИЕ!�ОПАСНО!!!

генерация изображения �по ТЗ требует множества попыток и пожирает время

13 of 26

СТРУКТУРА ПРОМПТА

носитель, объект, окружение, стиль, детали, цвета, освещение, ракурс, камера

black and white photograph of an innocent child, emotionally charged scene, in the style of hyperrealism and photorealism, dark amber, close-up portrait, photo taken with Hasselblad x2d 90V f2.5 --style raw --s 80 --ar 4:3 --r 3

14 of 26

РАКУРСЫ

  1. eye level shot
  2. low angle shot
  3. high angle shot
  4. hip level shot
  5. knee level shot
  6. ground level shot
  7. shoulder level shot
  8. dutch angle shot (oblique angle)
  9. overhead shot
  10. aerial shot
  11. macro
  12. close-up
  13. wide angle
  14. diagonal angle
  15. bird's eye view
  16. worm's eye view
  17. extreme close-up
  18. medium shot
  19. long shot
  20. extreme long shot
  21. two shot (два объекта в кадре)
  22. point of view (pov) shot
  23. over-the-shoulder shot

ОСВЕЩЕНИЕ

  1. golden hour
  2. dappled light
  3. backlight
  4. overcast light
  5. direct sunlight
  6. moonlight
  7. starlight
  8. twilight
  9. harsh midday light
  10. crepuscular rays
  11. chiaroscuro
  12. silhouette
  13. low key lighting
  14. photogram
  15. juxtaposition of light and shadow
  16. overexposed
  17. film noir
  18. double exposure
  19. HDR
  20. distortion
  21. lens flares
  22. light leaks
  23. light trails
  24. bioluminescence
  25. light beams
  26. light painting
  27. bokeh
  28. sparkles
  29. chromatic aberration
  30. laser beams
  31. studio lighting
  32. split lighting
  33. cinematic lighting
  34. volumetric lighting
  35. rim lighting
  36. [color] lighting
  37. [color] and [color] lighting
  38. stage lighting
  39. neon lights
  40. spotlight

15 of 26

ЧТО ДЕЛАТЬ, ЕСЛИ НЕ СЛУШАЕТСЯ?

Опишите деталь, которая должна быть в кадре

full body shot of a blonde girl, in the style of street photography, golden hour

full body shot of a blonde girl wearing white sneakers,

in the style of street photography, golden hour

16 of 26

НЕ БОЛЕЕ 6 ДЕТАЛЕЙ

Street photograph of a girl with pink hair in a striped tank top and denim shorts, standing in a phone booth. Her lips are painted green, she wears gold bracelets on her wrists, holds a large pink lollipop

Street photograph of a girl with pink hair in a striped tank top and denim shorts, standing in a phone booth. Her lips are painted green, she wears gold bracelets on her wrists, holds a large pink lollipop, and is wearing red sneakers

17 of 26

НАЗОВИТЕ�ПЕРСОНАЖЕЙ

Anya is a young woman wearing a white tank top, blue jeans, and red sneakers. ��Vanya is man with a beard dressed in a black leather jacket and black leather pants.

Anya and Vanya are walking on a night city street. In the style of candid shot photography, soft neon lighting.

18 of 26

РЕАЛИЗМ

shot on mobile phone

posted to snapchat

posted to reddit

—style raw

—s 70

photo ID, average 32 years old belarus {male, female}, imperfections, hair is neat, white background --style raw --ar 3:4 --s 65

19 of 26

ФОТОСЕССИЯ �С НЕИЗМЕННЫМ ПЕРСОНАЖЕМ

20 of 26

ПАРАМЕТРЫ

--c 20

хаос добавляет непредсказуемости

--r 4

повторить несколько раз

--sref random

применяет случайны стиль

--s 100

стилизация �от 0 до 1000

--style raw

без обработки

--ar 2:3

соотношение сторон

Чтобы научиться лихо управлять Midjourney, почитайте документацию. Она очень простая и понятная, с наглядными примерами.

21 of 26

ОПИСАНИЕ КАРТИНКИ ПО РЕФЕРЕНСУ

CREATIVE DESIGNER

EXECUTIVE DIRECTOR

/describe

Describe this image precisely.

Midjourney

ChatGPT

22 of 26

ПРОМПТ CHATGPT (CLAUDE, GEMINI) �ДЛЯ ДЕТАЛЬНОГО ОПИСАНИЯ РЕФЕРЕНСНОГО ИЗОБРАЖЕНИЯ

##Follow this rules to describe the image:##

**Formatting Rule:** Begin directly with the description, following each rule step-by-step. Do not mention rule names, terms like "medium," "object," "camera angles," or any unnecessary text. Keep the description under 1500 characters. Focus strictly on observable details, avoiding assumptions or storytelling beyond what is visible.

Start by describing the medium: Identify the medium or type of artwork first, such as "photograph," "street photography," "oil painting," "knolling," "editorial photo," or any other appropriate category that matches the input image.

Be specific or vague; clarify context and details: Determine the main theme or narrative of the image. Highlight details that draw attention or set the overall context of the scene, helping to frame the perception of the artwork.

Use visually well-defined objects: Clearly identify objects in the scene, such as "wizard," "priest," "angel," "emperor," "necromancer," "rockstar," "city," "queen," "Zeus," "house," "temple," "farm," "car," "landscape," "mountain," or "river." Include specific numbers when relevant, like "three cyberpunk wizards" or "two mountains." Describe poses, actions, expressions, and relationships between objects.

Convey strong feelings or themes: Capture the atmosphere or emotions conveyed by the image, such as "sense of awe," "will to endure," "cognitive resonance," "shores of infinity," "birth of time," "desire for knowledge," or "notion of self." Emphasize the primary emotional or thematic elements of the scene.

Describe the visual style of the image: Mention specific visual styles, such as "cyberpunk wizard," "surreal landscape," "psychedelic astronaut," and more. These examples provide a direction for the visual interpretation.

Indicate the artistic style: Define the broader artistic styles seen in the image, such as "cyberpunk," "psychedelic," "surreal," "vaporwave," "alien," "solarpunk," "modern," "ancient," "futuristic," "retro," "realistic," "dreamlike," "funk art," "abstract," "pop art," "impressionism," or "minimalism."

Mention unique artists or their combinations: If the style resembles the work of specific artists or combinations, include this in your description. For example, "a temple by James Gurney," "a figure inspired by M.C. Escher," or "a blend of Greg Rutkowski and Ross Tran."

Include the technique or medium: Identify the technique or medium used, such as "watercolor landscape," "child's drawing of a home," "sculpture," "graffiti," "oil on canvas," "pencil drawing," "charcoal drawing," "ink drawing," "matte painting," "fresco," "stone tablet," or "cave painting." This detail helps define the texture and material aspects of the image.

Specify the lighting techniques: Describe the lighting, such as "golden hour," "dappled light," "backlight," "overcast light," "direct sunlight," "moonlight," "starlight," "twilight," "harsh midday light," "chiaroscuro," "silhouette," "studio lighting," "neon lights," "cinematic lighting," "rim lighting," "volumetric lighting," or any combination that best represents the scene.

Identify the camera angle: Include the camera angle used in the image, such as "eye level shot," "low angle shot," "high angle shot," "aerial shot," "macro," "close-up," "wide angle," "overhead shot," "worm's eye view," "extreme close-up," "medium shot," "long shot," "point of view (POV) shot," or "over-the-shoulder shot." This provides a sense of perspective and how the viewer engages with the scene.

Use positive descriptions and avoid negatives: Clearly describe what is present, such as "blue hat," "half-person, half-robot," or "a psychedelic astronaut crew." Focus on positive and concrete elements rather than absent or negative aspects.

Specify details clearly: Be precise with counts and specific objects, for example, "three monkeys in business suits" or "a temple under a starry sky."

Use singular nouns or specific numbers: Be specific with counts, avoiding vague terms. Examples include "three cyberpunk wizards," "a psychedelic astronaut crew," or "a solarpunk city with holograms and futuristic glowing decorations."

Avoid significant extrapolation: Focus on what is directly visible without assuming hidden elements or extensive storytelling beyond what the image shows. Stick to what can be observed clearly.

Provide an example of the final description: For example, "A 3D digital art of three cyberpunk wizards standing on the shores of an infinite river under a sunset sky. The style combines elements of vaporwave and surrealism, executed in watercolor technique. The atmosphere conveys a sense of awe and a desire for knowledge, reminiscent of works by Salvador Dali and Greg Rutkowski. The lighting is warm, resembling the golden hour, and the camera angle is a low shot, adding drama and presence to the figures.

23 of 26

ТОП ИНСТРУМЕНТОВ: ГРАФИКА

  1. Midjourney – лучший генератор изображений
  2. Freepick – лучший набор качественных инструментов для дизайнера
  3. ChatGPT – создание концептов и управление текстом в Dall-e 3
  4. Microsoft Copilot – бесплатный генератор на основе Dall-e
  5. Ideogram – реалистичные визуалы с текстом
  6. Krea – быстрый и легко управляемый сервис на основе Flux + стилизация и апскейл
  7. Playground – генератор и редактор изображений через текстовые команды
  8. Recraft – удобный сервис для векторной графики и мокапов
  9. Dzine – стилизация изображений
  10. Adobe Firefly – генератор и редактор изображений от Adobe
  11. ClipDrop – редактор изображений для недизайнеров
  12. TinyWow – набор утилит для редактирования изображений и конвертации файлов
  13. Illusion Diffusion – инструмент для вписывания формы объекта в окружение
  14. DeepSwap – инструмент по замене лица на фото или видео
  15. InsightFaceSwap – качественная замена лица на фото (бот в дискорд)
  16. Fal.ai – тренировка собственных моделей (LoRA) на основе Flux
  17. Flux-pulid – создание похожего образа на основе одной фотографии
  18. FastFlux – очень быстрый и бесплатный генератор изображений на основе Flux Shnell
  19. Kolors – виртуальная примерочная
  20. LIvePortrait – оживляем статичные портреты

24 of 26

ТОП ИНСТРУМЕНТОВ: АНИМАЦИЯ И ВИДЕО

  1. Pika – бесплатный бот в Discord для создания простой анимации
  2. PixVerse – бесплатный бот в Discord для оживления картинок
  3. Viggle – анимация персонажа по референсному видео
  4. Luma Dream Machine – реалистичный видеогенератор по тексту и картинкам
  5. Kling – видеогенератор по тексту и кадрам
  6. RunwayML Gen-3 – кинематографичный видеогенератор
  7. Minimax – очень качественный китайский видеогенератор по тексту

Универсальная структура промпта для видеогенератора:

Объект (Описание объекта). Движение объекта. Сцена (Описание

Сцены и окружения). Ракурс и движение камеры. Освещение. Общая атмосфера и настроение

25 of 26

ТОП ИНСТРУМЕНТОВ: АУДИО

  1. Elevenlabs – озвучка, клонирование голоса, чтение аудиокниг
  2. Suno – самый навороченный генератор музыки и песен (поддерживает русский)
  3. Udio – альтернатива Suno с более качественным звуком, но чуть сложнее управлять

Как написать песню:

Попросить Claude Sonnet 3.5 или Claude Opus написать лирику/стихотворение на заданную тему и в заданном стиле. Вставить получившийся текст в Suno и описать стиль.

26 of 26

Спасибо �и удачи!

@aimastersme

канал о практике применения ИИ