{"id":594,"date":"2025-12-08T15:27:43","date_gmt":"2025-12-08T14:27:43","guid":{"rendered":"https:\/\/blog.oldwisebear.com\/?p=594"},"modified":"2025-12-08T15:27:43","modified_gmt":"2025-12-08T14:27:43","slug":"the-ai-journey-video-creation","status":"publish","type":"post","link":"https:\/\/blog.oldwisebear.com\/index.php\/2025\/12\/08\/the-ai-journey-video-creation\/","title":{"rendered":"The AI Journey &#8211; video creation"},"content":{"rendered":"\n<figure class=\"wp-block-image aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1913\" height=\"964\" src=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/06\/Penguin_pergamin.jpg\" alt=\"\" class=\"wp-image-559\" style=\"width:681px;height:auto\" srcset=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/06\/Penguin_pergamin.jpg 1913w, https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/06\/Penguin_pergamin-768x387.jpg 768w, https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/06\/Penguin_pergamin-1536x774.jpg 1536w\" sizes=\"auto, (max-width: 1913px) 100vw, 1913px\" \/><\/figure>\n\n\n\n<div class=\"wp-block-group is-nowrap is-layout-flex wp-container-core-group-is-layout-7387b849 wp-block-group-is-layout-flex\">\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"324\" height=\"275\" src=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/06\/Penguin_smile.jpg\" alt=\"\" class=\"wp-image-561\" \/><\/figure>\n\n\n\n<div class=\"wp-block-group is-layout-constrained wp-block-group-is-layout-constrained\">\n<p class=\"wp-block-paragraph\">The other topic that intrigued me, as mentioned in one of the AI posts, was video creation.<\/p>\n\n\n\n<pre id=\"block-76c80ec1-aeac-435d-a73a-081c59ced0e2\" class=\"wp-block-preformatted\">Can I finally become a movie director without special skills or a multimillion-dollar budget?<\/pre>\n<\/div>\n<\/div>\n\n\n\n<h2 class=\"wp-block-heading\">The Tools<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">I decided to start this journey around the time Veo3 was announced. Along with it came a huge set of promotional videos that created a global \u201cwow\u201d effect and immediately caught my attention. Unfortunately, my country wasn\u2019t on the initial release list, so I had to experiment with Veo2 and other models for almost a month. During that time, I managed to create two videos, which you can watch here:<br><a href=\"https:\/\/www.youtube.com\/watch?v=QcAanI0b5aU\">https:\/\/www.youtube.com\/watch?v=QcAanI0b5aU<\/a><br><a href=\"https:\/\/www.youtube.com\/watch?v=cuwr2zRCYoE\">https:\/\/www.youtube.com\/watch?v=cuwr2zRCYoE<\/a><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Long story short, most attempts were a miss. After many days, I finally completed a video, but the process was painfully slow due to all the limitations.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Current day status<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">After that, I drifted in and out of video generation due to time constraints. Still, I kept an eye on every new release, waiting for the breakthrough that would truly click for me. In the meantime, Veo 3.1 was launched, and since nothing else stood out, I\u2019ve continued using the \u201cFast\u201d version because the \u201cQuality\u201d mode isn\u2019t worth the increased credit cost, at least for my needs.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Recently, I set myself a new challenge. Cars have always been one of my passions, so I decided to create a promotional video. Watching Bugatti ads or Subaru\u2019s Impreza WRX commercials always made my heart skip a beat and sparked the desire to drive one. With that inspiration, I challenged myself to make a promo video for a hypothetical new Nissan GT-R. First, I designed a concept by reusing the current model and adding some tweaks. Needless to say, this was a lengthy process, before I got the result I wanted, I had to cut and paste various parts and ask Gemini to stitch them together.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/car1.png\" alt=\"\" class=\"wp-image-633\" style=\"width:598px;height:auto\" srcset=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/car1.png 1024w, https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/car1-768x768.png 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">In the end, it turned out quite decent (though I couldn\u2019t get it to create a straight LED strip under the hood aligned with the headlights).<\/p>\n\n\n\n<div class=\"wp-block-group is-nowrap is-layout-flex wp-container-core-group-is-layout-7387b849 wp-block-group-is-layout-flex\">\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"512\" height=\"512\" src=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/image.png\" alt=\"\" class=\"wp-image-630\" \/><\/figure>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"512\" height=\"512\" src=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/image-2.png\" alt=\"\" class=\"wp-image-632\" \/><\/figure>\n<\/div>\n\n\n\n<p class=\"wp-block-paragraph\">With the starting image ready, I generated a set of images of the car in various scenarios.<\/p>\n\n\n\n<div class=\"wp-block-group is-layout-grid wp-container-core-group-is-layout-6c3a3c5a wp-block-group-is-layout-grid\">\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/car4.png\" alt=\"\" class=\"wp-image-634\" srcset=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/car4.png 1024w, https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/car4-768x768.png 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/Gemini_Generated_Image_bp70h4bp70h4bp70.png\" alt=\"\" class=\"wp-image-635\" srcset=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/Gemini_Generated_Image_bp70h4bp70h4bp70.png 1024w, https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/Gemini_Generated_Image_bp70h4bp70h4bp70-768x768.png 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<figure class=\"wp-block-image size-full wp-container-content-b5090af8\"><img loading=\"lazy\" decoding=\"async\" width=\"1056\" height=\"992\" src=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/car3-1.png\" alt=\"\" class=\"wp-image-637\" srcset=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/car3-1.png 1056w, https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/car3-1-768x721.png 768w\" sizes=\"auto, (max-width: 1056px) 100vw, 1056px\" \/><\/figure>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/image-3.png\" alt=\"\" class=\"wp-image-639\" srcset=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/image-3.png 1024w, https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/image-3-768x768.png 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n<\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Next came the video creation. I worked in both Gemini Chat and Flow. While Gemini could only use the reference picture to create a scene, Flow allowed me to add transitions between points A and B. However, I ran into several issues, so let\u2019s go through some of them.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">The problems<\/h2>\n\n\n\n<div class=\"wp-block-group is-nowrap is-layout-flex wp-container-core-group-is-layout-7387b849 wp-block-group-is-layout-flex\">\n<figure class=\"wp-block-image aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/image-4.png\" alt=\"\" class=\"wp-image-640\" style=\"width:506px;height:auto\" srcset=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/image-4.png 1024w, https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/image-4-768x768.png 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<div class=\"wp-block-group wp-container-content-9e2f13cb is-vertical is-layout-flex wp-container-core-group-is-layout-2c90304e wp-block-group-is-layout-flex\">\n<p class=\"wp-container-content-9e2f13cb wp-block-paragraph\">The first problem I encountered was related to what Google Veo allows. I wanted to start my video with a dramatic shot of a storm approaching the coast of Japan, with a boy pointing at it, clearly excited. However, no matter how I described the scene, I was always greeted with the same message:<\/p>\n\n\n\n<pre class=\"wp-block-verse\"><em>I can't generate that video. Try describing another idea. You can also get tips for how to write prompts and review our video policy guidelines. <a href=\"https:\/\/gemini.google\/policy-guidelines\/\" target=\"_blank\" rel=\"noreferrer noopener\">Learn mor<\/a><\/em><\/pre>\n\n\n\n<p class=\"wp-block-paragraph\">When I asked Gemini about the issue, the explanation was:<br><\/p>\n\n\n\n<pre class=\"wp-block-verse\"><em>To answer your question regarding the guidelines: <strong>This specific image and prompt combination likely triggers \"Child Safety\" precautions.<\/strong><\/em><\/pre>\n\n\n\n<p class=\"wp-block-paragraph\">While I understand the reasons behind these safeguards, my prompt didn\u2019t imply any violence or wrongdoing. Even when I tried to follow Gemini\u2019s suggestion to describe the boy as a young adult, the system still refused to generate the video. In the end I removed the boy from the clip and moved on.<\/p>\n<\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-group is-nowrap is-layout-flex wp-container-core-group-is-layout-7387b849 wp-block-group-is-layout-flex\">\n<div class=\"wp-block-group wp-container-content-9e2f13cb is-vertical is-layout-flex wp-container-core-group-is-layout-2c90304e wp-block-group-is-layout-flex\">\n<p class=\"wp-container-content-9e2f13cb wp-block-paragraph\">The second problem involved elements that required consistent and logical changes throughout the shot specifically numbers and text. No matter how I described the speedometer animation, it didn\u2019t work. I tried prompts like <em>\u201cshow it go from 0 to 150 as if it were a sports car\u201d<\/em> and <em>\u201cshow the car gaining speed with each second; the shot should display these values on the digital meter: 0, 10, 25, 40, 60, 80, 100.\u201d<\/em> In the end, I had to stitch together several images with a clip to simulate the desired effect, but it was far from perfect. Animating text also posed challenges, though those were solvable after a few attempts.<\/p>\n<\/div>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"480\" height=\"270\" src=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/The_camera_is_202512070025.gif\" alt=\"\" class=\"wp-image-641\" style=\"width:506px;height:auto\" \/><\/figure>\n<\/div>\n\n\n\n<div class=\"wp-block-group is-nowrap is-layout-flex wp-container-core-group-is-layout-7387b849 wp-block-group-is-layout-flex\">\n<figure class=\"wp-block-image aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"480\" height=\"270\" src=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/The_camera_slowly_202512080104.gif\" alt=\"\" class=\"wp-image-642\" style=\"width:506px;height:auto\" \/><\/figure>\n\n\n\n<div class=\"wp-block-group wp-container-content-9e2f13cb is-vertical is-layout-flex wp-container-core-group-is-layout-2c90304e wp-block-group-is-layout-flex\">\n<p class=\"wp-container-content-9e2f13cb wp-block-paragraph\">The third issue was the less-than-ideal scene transitions when creating A\u2192B clips. I provided the starting and ending images along with this prompt:<\/p>\n\n\n\n<pre class=\"wp-block-verse wp-container-content-9e2f13cb\">The camera slowly move backwards showing more of the car which is in a move. After a moment the scene changes suddenly to a small Japanese village with a spiritual feeling. After another moment the scene changes again this time showing the car in front of the Japanese temple. The only sounds in the background, through the whole video should be the Japanese cicadas<\/pre>\n\n\n\n<p class=\"wp-block-paragraph\">The result, as you can see, was far from perfect especially the poorly drawn cicadas, which were meant to be only an audio element, not a visual one.<\/p>\n<\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-group is-nowrap is-layout-flex wp-container-core-group-is-layout-7387b849 wp-block-group-is-layout-flex\">\n<div class=\"wp-block-group wp-container-content-9e2f13cb is-vertical is-layout-flex wp-container-core-group-is-layout-2c90304e wp-block-group-is-layout-flex\">\n<p class=\"wp-container-content-9e2f13cb wp-block-paragraph\">The last and most obvious issue was the model\u2019s tendency to produce errors caused by overly short or overly detailed prompts and its own \u201cimagination.\u201d I must admit, a moonwalking Godzilla is quite a sight, but that wasn\u2019t the race I had in mind. Similarly, when I asked for a car driving in a Japanese harbor, the model not only added a new car instead of animating the existing one, but also placed it in the middle of the sea. I did mention a raging storm in the prompt, but wasn&#8217;t expecting that.<br>How did I fix these? Sometimes rephrasing the prompt with simpler or alternative wording helped. But the most effective solution was breaking the scene into several smaller, straightforward shots. That approach finally gave me the results I wanted.<\/p>\n<\/div>\n\n\n\n<div class=\"wp-block-group is-vertical is-layout-flex wp-container-core-group-is-layout-2c90304e wp-block-group-is-layout-flex\">\n<figure class=\"wp-block-image aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"480\" height=\"270\" src=\"https:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/The_car_is_202512080103.gif\" alt=\"\" class=\"wp-image-643\" style=\"width:506px;height:auto\" \/><\/figure>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"480\" height=\"270\" src=\"http:\/\/blog.oldwisebear.com\/wp-content\/uploads\/2025\/12\/Keep_the_top_202512070020.gif\" alt=\"\" class=\"wp-image-644\" \/><\/figure>\n<\/div>\n<\/div>\n\n\n\n<h2 class=\"wp-block-heading\">Conclusion<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">While it\u2019s not quite there yet for me, with enough time and solid editing skills, these tools can already be powerful, especially when combined with image generation from Gemini\u2019s Nano Banana Pro model or Midjourney as a base for videos (or for creating A\u2192B clips).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Few takes from all of this:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Various models deliver similar results, but in my opinion, Veo 3.1 is the most consistent and produces the best output.<\/strong><br>That said, it\u2019s still far from ideal, and the number of attempts you get with the Pro plan is laughable if you\u2019re trying to create something serious.<\/li>\n\n\n\n<li><strong>Write down the key points of your video, but keep prompts simple, the simpler the prompt, the better the result.<\/strong><br>Combine AI output with video editing skills to achieve your vision faster instead of wasting credits on overly complex prompts.<\/li>\n\n\n\n<li><strong>Add voices or music in post-production, keep video prompts focused on simple sounds. <\/strong><br>This makes it easier to merge clips and maintain consistent audio\/voices for storytelling.<\/li>\n\n\n\n<li><strong>Make sure you have a lot of patience too&#8230;<\/strong><br>&#8230;especially when trying to nail the right angle.<\/li>\n\n\n\n<li><strong>And possibly If you have the resources, consider training your own model.<\/strong><br>Current guidelines in Gemini and similar tools can feel arbitrary, and there\u2019s no clear way to dispute guideline violations.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">And with that, here\u2019s the final creation for my latest idea:<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"New Nissan GT-R 2026 Promo Video (Veo 3.1 AI)\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/cFzelm4242k?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n","protected":false},"excerpt":{"rendered":"<p>The other topic that intrigued me, as mentioned in one of the AI posts, was video creation. Can I finally become a movie director without special skills or a multimillion-dollar budget? The Tools I decided to start this journey around the time Veo3 was announced. Along with it came a huge set of promotional videos [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"default","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","ast-disable-related-posts":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"set","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"categories":[35,30,16],"tags":[36,22],"class_list":["post-594","post","type-post","status-publish","format-standard","hentry","category-ai","category-challenge","category-hobby","tag-ai","tag-penguin"],"_links":{"self":[{"href":"https:\/\/blog.oldwisebear.com\/index.php\/wp-json\/wp\/v2\/posts\/594","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.oldwisebear.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.oldwisebear.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.oldwisebear.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.oldwisebear.com\/index.php\/wp-json\/wp\/v2\/comments?post=594"}],"version-history":[{"count":5,"href":"https:\/\/blog.oldwisebear.com\/index.php\/wp-json\/wp\/v2\/posts\/594\/revisions"}],"predecessor-version":[{"id":646,"href":"https:\/\/blog.oldwisebear.com\/index.php\/wp-json\/wp\/v2\/posts\/594\/revisions\/646"}],"wp:attachment":[{"href":"https:\/\/blog.oldwisebear.com\/index.php\/wp-json\/wp\/v2\/media?parent=594"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.oldwisebear.com\/index.php\/wp-json\/wp\/v2\/categories?post=594"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.oldwisebear.com\/index.php\/wp-json\/wp\/v2\/tags?post=594"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}