ВИЗУАЛЬНЫЕ НЕЙРОНКИ
Time to have some fun!
СЕРГЕЙ ШИМА
@aimastersme
канал о практике применения ИИ
где искать инструменты
theresanaiforthat.com
futuretools.io
полезные ресурсы
https://www.aiforwork.co/ - коллекция промптов для рабочих задач
https://www.youtube.com/watch?v=wjZofJX0v4M - разобраться в деталях, как работает технология GPT
https://www.youtube.com/watch?v=yPTLhE_isr0 - не скучный научпоп об ИИ революции
https://academy.bothub.chat/ - хороший текстовый курс по ChatGPT
https://www.perplexity.ai/ - спросить, где можно посмотреть/почитать материалы по интересующей теме
как обучают визуальные модели
зеленый
собака
терьер
коричневый нос
коричневые глаза
фон зеленый
иллюстраций
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
шрифты
управляемость
фотореализм
красота
Stable Diffusion
MIdjourney
Flux
Playground
Ideogram
Универсальный боец. Отлично подходит для рекламных изображений, фонов, креативных КВ, раскадровки, копирования стиля. |
Набор моделей для разных задач. Сложный вход, сложно управлять, но можно добиться практически всего, чего задумано. |
Эволюция Stable Diffusion. Хорошо слушается промпта. Имеет дополнительные плагины, чтобы обучаться, как выглядят новые объекты и стили. |
Отлично слушается команд. Однообразный узнаваемый стиль. Подходит только для концептуализации. |
Управление текстом. Использует разные модели для генерации изображений. Отлично годится для создания концепций. |
Для работы с текстом и типографикой. Новая модель 2.0 прекрасно слушается промпта и выдаёт достаточно реалистичные изображения. Классно работает со шрифтами. |
особенности
MIDJOURNEY
STABLE DIFFUSION
(FLUX)
DALL-E 3
MIDJOURNEY
2022
2023
november
november
ВНИМАНИЕ!�ОПАСНО!!!
генерация изображения �по ТЗ требует множества попыток и пожирает время
СТРУКТУРА ПРОМПТА
носитель, объект, окружение, стиль, детали, цвета, освещение, ракурс, камера
black and white photograph of an innocent child, emotionally charged scene, in the style of hyperrealism and photorealism, dark amber, close-up portrait, photo taken with Hasselblad x2d 90V f2.5 --style raw --s 80 --ar 4:3 --r 3
РАКУРСЫ
ОСВЕЩЕНИЕ
ЧТО ДЕЛАТЬ, ЕСЛИ НЕ СЛУШАЕТСЯ?
Опишите деталь, которая должна быть в кадре
full body shot of a blonde girl, in the style of street photography, golden hour
full body shot of a blonde girl wearing white sneakers,
in the style of street photography, golden hour
НЕ БОЛЕЕ 6 ДЕТАЛЕЙ
Street photograph of a girl with pink hair in a striped tank top and denim shorts, standing in a phone booth. Her lips are painted green, she wears gold bracelets on her wrists, holds a large pink lollipop
Street photograph of a girl with pink hair in a striped tank top and denim shorts, standing in a phone booth. Her lips are painted green, she wears gold bracelets on her wrists, holds a large pink lollipop, and is wearing red sneakers
НАЗОВИТЕ�ПЕРСОНАЖЕЙ
Anya is a young woman wearing a white tank top, blue jeans, and red sneakers. ��Vanya is man with a beard dressed in a black leather jacket and black leather pants.
Anya and Vanya are walking on a night city street. In the style of candid shot photography, soft neon lighting.
РЕАЛИЗМ
shot on mobile phone
posted to snapchat
posted to reddit
—style raw
—s 70
photo ID, average 32 years old belarus {male, female}, imperfections, hair is neat, white background --style raw --ar 3:4 --s 65
ФОТОСЕССИЯ �С НЕИЗМЕННЫМ ПЕРСОНАЖЕМ
ПАРАМЕТРЫ
--c 20
хаос добавляет непредсказуемости
--r 4
повторить несколько раз
--sref random
применяет случайны стиль
--s 100
стилизация �от 0 до 1000
--style raw
без обработки
--ar 2:3
соотношение сторон
Чтобы научиться лихо управлять Midjourney, почитайте документацию. Она очень простая и понятная, с наглядными примерами.
ОПИСАНИЕ КАРТИНКИ ПО РЕФЕРЕНСУ
CREATIVE DESIGNER
EXECUTIVE DIRECTOR
/describe
Describe this image precisely.
Midjourney
ChatGPT
ПРОМПТ CHATGPT (CLAUDE, GEMINI) �ДЛЯ ДЕТАЛЬНОГО ОПИСАНИЯ РЕФЕРЕНСНОГО ИЗОБРАЖЕНИЯ
##Follow this rules to describe the image:##
**Formatting Rule:** Begin directly with the description, following each rule step-by-step. Do not mention rule names, terms like "medium," "object," "camera angles," or any unnecessary text. Keep the description under 1500 characters. Focus strictly on observable details, avoiding assumptions or storytelling beyond what is visible.
Start by describing the medium: Identify the medium or type of artwork first, such as "photograph," "street photography," "oil painting," "knolling," "editorial photo," or any other appropriate category that matches the input image.
Be specific or vague; clarify context and details: Determine the main theme or narrative of the image. Highlight details that draw attention or set the overall context of the scene, helping to frame the perception of the artwork.
Use visually well-defined objects: Clearly identify objects in the scene, such as "wizard," "priest," "angel," "emperor," "necromancer," "rockstar," "city," "queen," "Zeus," "house," "temple," "farm," "car," "landscape," "mountain," or "river." Include specific numbers when relevant, like "three cyberpunk wizards" or "two mountains." Describe poses, actions, expressions, and relationships between objects.
Convey strong feelings or themes: Capture the atmosphere or emotions conveyed by the image, such as "sense of awe," "will to endure," "cognitive resonance," "shores of infinity," "birth of time," "desire for knowledge," or "notion of self." Emphasize the primary emotional or thematic elements of the scene.
Describe the visual style of the image: Mention specific visual styles, such as "cyberpunk wizard," "surreal landscape," "psychedelic astronaut," and more. These examples provide a direction for the visual interpretation.
Indicate the artistic style: Define the broader artistic styles seen in the image, such as "cyberpunk," "psychedelic," "surreal," "vaporwave," "alien," "solarpunk," "modern," "ancient," "futuristic," "retro," "realistic," "dreamlike," "funk art," "abstract," "pop art," "impressionism," or "minimalism."
Mention unique artists or their combinations: If the style resembles the work of specific artists or combinations, include this in your description. For example, "a temple by James Gurney," "a figure inspired by M.C. Escher," or "a blend of Greg Rutkowski and Ross Tran."
Include the technique or medium: Identify the technique or medium used, such as "watercolor landscape," "child's drawing of a home," "sculpture," "graffiti," "oil on canvas," "pencil drawing," "charcoal drawing," "ink drawing," "matte painting," "fresco," "stone tablet," or "cave painting." This detail helps define the texture and material aspects of the image.
Specify the lighting techniques: Describe the lighting, such as "golden hour," "dappled light," "backlight," "overcast light," "direct sunlight," "moonlight," "starlight," "twilight," "harsh midday light," "chiaroscuro," "silhouette," "studio lighting," "neon lights," "cinematic lighting," "rim lighting," "volumetric lighting," or any combination that best represents the scene.
Identify the camera angle: Include the camera angle used in the image, such as "eye level shot," "low angle shot," "high angle shot," "aerial shot," "macro," "close-up," "wide angle," "overhead shot," "worm's eye view," "extreme close-up," "medium shot," "long shot," "point of view (POV) shot," or "over-the-shoulder shot." This provides a sense of perspective and how the viewer engages with the scene.
Use positive descriptions and avoid negatives: Clearly describe what is present, such as "blue hat," "half-person, half-robot," or "a psychedelic astronaut crew." Focus on positive and concrete elements rather than absent or negative aspects.
Specify details clearly: Be precise with counts and specific objects, for example, "three monkeys in business suits" or "a temple under a starry sky."
Use singular nouns or specific numbers: Be specific with counts, avoiding vague terms. Examples include "three cyberpunk wizards," "a psychedelic astronaut crew," or "a solarpunk city with holograms and futuristic glowing decorations."
Avoid significant extrapolation: Focus on what is directly visible without assuming hidden elements or extensive storytelling beyond what the image shows. Stick to what can be observed clearly.
Provide an example of the final description: For example, "A 3D digital art of three cyberpunk wizards standing on the shores of an infinite river under a sunset sky. The style combines elements of vaporwave and surrealism, executed in watercolor technique. The atmosphere conveys a sense of awe and a desire for knowledge, reminiscent of works by Salvador Dali and Greg Rutkowski. The lighting is warm, resembling the golden hour, and the camera angle is a low shot, adding drama and presence to the figures.
ТОП ИНСТРУМЕНТОВ: ГРАФИКА
ТОП ИНСТРУМЕНТОВ: АНИМАЦИЯ И ВИДЕО
Универсальная структура промпта для видеогенератора:
Объект (Описание объекта). Движение объекта. Сцена (Описание
Сцены и окружения). Ракурс и движение камеры. Освещение. Общая атмосфера и настроение
ТОП ИНСТРУМЕНТОВ: АУДИО
Как написать песню:
Попросить Claude Sonnet 3.5 или Claude Opus написать лирику/стихотворение на заданную тему и в заданном стиле. Вставить получившийся текст в Suno и описать стиль.
Спасибо �и удачи!
@aimastersme
канал о практике применения ИИ