The easiest online tool to make videos.  Work in teams or work solo.  Make a video for every moment, every content, everything!

Video Creation: The Must-Have ‘Build vs Buy’ Guide for Developers

This guide will help you make the right decision when you find yourself asking, ‘Should I build or buy a video creation app for my users?’

First, I will take you through the importance of video marketing, help you understand the different elements of a video, and then walk you through some steps involved in creating your own video maker from scratch. By considering factors like time, resources, expenses, and the process involved, I will then explore all possible alternatives and help you come to a quick decision.

Importance of video marketing

This video explains how important video marketing is for a business, regardless of the industry you’re in!

So you know the importance of video marketing and have probably been hearing about how other businesses and your competitors are rapidly adopting it.

Text-based videos are becoming increasingly common on different social media networks like Facebook, Twitter, and Instagram. Businesses are rapidly adopting these kinds of videos over professional shoots and live action videos because they’re faster to create and easier to consume.

Understanding the different elements of a video

If you’re considering a solution to enable your internal team or your users to make videos on your website or app, you have to start by understanding what some of the components of a video are and what each element means.

These are the different components of a video: animations, captions, media, icons, and voiceover
  • Voiceover. Also known as video narration, a voiceover is an audio file that plays over the video. The duration of the voiceover is controllable and it can either last for one particular scene of the video, or for the entire length of the video.

  • Captions. These are text elements that can be placed anywhere on the video. The position and duration of a subtitle are controllable. You can edit the language of the text, customize its color, highlight some keywords, and adjust the size of each subtitle.

    Video subtitles are pretty useful because most of the times, on social media, videos are played on mute! Did you know that 85% of videos on Facebook are played without sound? (source)

  • Animations or motion graphics. This is NOT to be confused with animated videos. By animations, I refer to the different types of overlays and scene transitions that can be included in every scene of the video.

  • Media file. You can have a video with one image/video clip in each scene, or a collage of multiple images/video clips playing in a scene. Sometimes, you can have scenes without any image/video clip as well.

Equipped with this knowledge, we can now explore the possible ways you can enable your internal team or your users to make videos on your website or app.

Building your own video maker from scratch

Let us assume that you have decided to build your own video creating tool from scratch. Here are some steps you will go through:

  1. Researching and prototyping
  2. Building an MVP
  3. Beta testing and bug/UX fixes
  4. Alpha release
  5. User feedback gathering and iterating

Researching and prototyping (2-3 weeks)

Instead of directly jumping into coding, you should start off by gathering information and conducting research. Get started by curating a list of features, the key deliverables, how the output should be represented and processed, and plan the code architecture. This will ideally take you and your team about 2-3 weeks.


Number of people2 (1 designer + 1 developer)
Time2-3 weeks
Expenses (salaries and compensations)$1000

To help you get started, here are some frequently asked questions (FAQs) on ffmpeg and aerender, two terms you will come across multiple times. Note that FFmpeg and Aerender are useful for backend video rendering.

• What is FFmpeg?

FFmpeg is an open-source project which consists of a huge repository of libraries and programs to decode, encode, transcode, mux, demux, stream, filter and play video, audio, and other multimedia files. Written in C, FFmpeg is very comprehensive in the number of operations you can do with it. You can run FFmpeg commands from your terminal in Microsoft Windows, Linux, or Mac OS.

• What is Aerender?

Aerender.exe is a program made by Adobe. Since Aerender is a command-line application, you can use it to perform rendering operations from your terminal.

• How do I pick between FFmpeg and aerender?

For very basic videos with minimalistic transitions (like the preset animations in Microsoft Powerpoint), FFmpeg is the perfect choice. You can perform operations like adding text or voiceover to videos. However, if your video has animated overlays and motion graphics that require heavy rendering, you should consider aerender.

One big question you need to get an answer to during this researching phase is whether you want to build a UI or not. Video making can be enabled without a UI – you can build scripts and write commands to fire them when needed. Or you can build a basic UI that looks something like this:

This is Rocketium’s existing video editor – an example of how the UI of your video editor can look like.

Depending on the need, you can take a call on whether a UI is required or not. Having a UI will give more customization options to your user, and they will be able to see a preview of how the video will look like before publishing it.

Once you’ve researched and built a prototype, it is time to start what you are the best – coding. Get those coffee mugs and Mountain Dew cans ready cause the next few months are going to be pretty long!

Building an MVP (6-12 weeks)

You need to build just enough features and functionality of your minimum viable product to get the first round of feedback from early adopters and testers. Depending on your requirements, this step can take anywhere between 6 to 12 weeks.

Releasing a basic version of the video editor

A basic version of your video maker will allow your internal team and your users to make videos with minimalistic animations and transitions, no motion graphics, no voiceover, and no customization options.

Number of people3 (2 designers + 1 developer)
Time6-12 weeks
Expenses (salaries, licenses, libraries)$2000

Here is a checklist of features you will need to add:

  • Font control. Your users should be able to select fonts from a given list. You will need to get commercial licenses for fonts that you wish to include in your video maker. Here is a collection of some of the best fonts for video.

  • Color control. You need to allow your users to select their own color palette for fonts or the scene background. You can also add some preset color palettes to save video creation time. Here is an article on how to select preset color palettes for videos.

  • Media control. Your users should be able to upload their own images/videos or import them from cloud storage (like Dropbox, Google Drive, or S3). As an additional feature, you can also build a media library with royalty-free stock images and video footage. Note that you will need to purchase commercial licenses from content houses like Shutterstock, Getty, or Pixabay.

  • Minimalistic animations and transitions. You should enable your users to choose some basic text, scene, and media animations. Some animations that do not need heavy rendering are the ones you would’ve come across in Microsoft Powerpoint and other PowerPoint alternatives. Here are some examples of basic and advanced text animations:
Some examples of basic and advanced text animations
  • Theme and template selection. Since there are limited animations and transitions, your users should have access to some built-in themes and templates. This will require your design team to build preset templates which include text fonts, background music, color palette, and some animations and transitions.

  • Background music selection. Though videos are, by default, played without sound on social media, videos on your website or YouTube need soundtracks. Your users should be able to browse a collection of soundtracks or upload their own. Note that you will need to purchase commercial licenses from music providers. Here is a list to get started.

To grant more control over the video components to your users, you need to build some additional functionalities, which extends your build time and the resources you’re investing. This also means creating more preset animations, themes, and templates – which increases the design turnaround time and expenses.

Releasing an advanced version of the video editor

An advanced video editor gives more customizing options to your users. You can allow them to position their text captions and media anywhere on the video canvas, bring in multiple images or video clips per scene, and give control over timing and voiceovers.

Number of people5 (3 developers, 2 designers)
Time10-12 weeks
Expenses (salaries, licenses, libraries)$3500

Here is a checklist of the extra functionality and features you will need to add:

  • Logo support. An important aspect of video marketing is branding. Your own internal team would want to be able to add your logo and place it anywhere on the scene, or even hide it in certain scenes. Building this requires you to go back and revisit your code architecture.

  • Position control. Your users should be able to position the text captions and media files anywhere on the scene, either by dragging them or using the arrow keys on the keyboard. Read how we built our own interactive video editor here.

  • Timing control. You need to enable your users to have control over the timings of individual captions. They should be able to set their own delays and caption durations using timestamps of the video.

  • Advanced animations and transitions. By building this, your users will be able to choose animations, transitions, and motion graphics that need heavy rendering. You will need to consider the design time and resources needed to create these preset animations for your users.

  • PIP (picture-in-picture). Sometimes your users would want to be able to add multiple images/video clips in one scene and control their individual positions. Building this would require you to revisit your prototype and need some changes to the code architecture.

  • Voiceover support. Your users should be able to upload or record audio files as narration for the entire video or individual scenes. A major challenge here is to enable syncing of the audio file with the video and allowing users to adjust it according to their needs.

  • Video preview. Building a preview of the video currently being made is the biggest piece of your video making tool. This requires you to brush up your knowledge of ReactJS and HTML.

Clearly building your own video maker requires a lot of time and resources, and is quite expensive. But the list of limitations does not stop here.

Limitations of building your own video maker

Even after you’ve built a working solution, you will need a dedicated team of designers to keep releasing new templates, themes, and animations as and when the need arises. To release multiple variations, templates, and more features, you need to repeat the entire process above and involve the same number of people. The next step could be to hire a dedicated mobile app development team to help adapt your design for mobile apps.

Another limitation for this is cost – not only the build cost but the maintenance. Video rendering is quite expensive. Amazon’s video transcoding service starts at $0.0075/minute. To understand this better, imagine you own a real estate platform. Your team needs to make video property ads of over 10,000 properties listed on your website. Each ad copy needs 2 variations for A/B testing, and each ad copy needs to be made in 3 different languages. Moreover, each video ad is 2 mins long. Your total spend, just for video rendering, now becomes $900! This is excluding the cost of servers you will need for storing, computing, rendering, and other devops related processes like monitoring and error logging.

Your internal team will need new sets of animations and themes for each campaign they run. They will also need to re-create existing videos in different languages to cater to a global audience.

And this takes us to the biggest limitation of building your own solution – scale. If your team or your users need to make 1-2 videos a month, this solution works, but it does not justify the cost and time taken to build it. If the need increases more than 20 videos per month, this solution fails because you need to repeat the same process for each new template or style variation. So, now the question arises – what alternates do I have?

Alternatives to building your own video maker

What you need is an existing solution that can be plugged into your website or app. A solution that enables your own team or your users to make videos.

This can be achieved in two ways – you hire an agency to help you create videos at scale, or you use existing libraries and video creation APIs.

Alternative #1: Hire an agency and outsource video creation

An agency charges anywhere between $6,000 to $10,000 to create one high-quality video. Added to this, the turnaround time in most cases is very high because the process becomes too long and involves multiple iterations.

You need a permanent, self-sustaining solution that your internal team or your users can rely on to create multiple videos with different style variations, themes, and languages. And all of this becomes quite difficult to achieve with the help of an agency.

If you wish to enable your users to create videos, then a video agency cannot contribute much to it. The only work you can outsource to them is the creation of preset themes and templates, and some advanced animation and transition styles. You will still have to spend time and resources into building the solution end-to-end.

Alternative #2: Use existing APIs and libraries

The primary advantage of using existing APIs and libraries is that you can implement your go-to-market strategy much faster than when you build your own solution. This helps you stay several steps ahead of your competitors. make close to 100 videos a month using Rocketium's video API

Rocketium’s suite of video APIs enables you to run video automation processes on your website. With these APIs, you can explore multiple ways to implement video creation – some that you may not have thought of earlier!

Code integration

FactorNumber? Metrics? Impact?
Number of people1 developer
Time2-3 weeks
Expenses (API cost, salaries)< $1000

You can build your own custom UI and Rocketium’s video API will handle all the backend processes, including storage, computing, processing, and rendering on its own.

Example of how Rocketium’s video API can be used to create a video making experience on your website

To get started, you can refer to the API docs and play around in the sandbox.

Through Rocketium APIs, you can allow your users to perform advanced video operations like trimming, language translation, text to speech, voiceover addition, subtitles, and direct sharing to social media.

Assuming you do not need a dedicated UI, there are two more solutions you can offer to your users:

Form to video

A form to video solution is ideal for when you want to run UGC (user-generated content) campaigns, where you allow your users to make videos by simply filling in a web-based form.

Example of how a form to video solution can be implemented using Rocketium’s video APIs

You can either build this form yourself, or you can run preset scripts on Google Forms that make calls to Rocketium’s video creation APIs whenever the form is submitted. You can then make calls to APIs that fetch the video once it is generated, and store it in a Google Sheet along with the responses, or store it in your own database.

Google Sheets to video

If you don’t want to build your own forms, you can enable your users to make videos from a spreadsheet. The process for your user is simple – each row represents a video, and they can add the content and media files by inserting text and links into the columns of the Google Sheet.

Example of how a Google Sheet can be used to make videos automatically using Rocketium’s API

Key takeaways

Sometimes you have to ask yourself, “Do we really need to reinvent the wheel? Or can we get a wheel from somewhere, and focus on building innovative things around it?”

Building your own video-making tool is not only very time consuming (roughly 8-15 weeks), but the amount of resources you will put into it does not justify the results (approximately $6000 and 5 people involved working full-time). Outsourcing to an agency is also not a viable option due to a high turnaround time that involves multiple iterations.

The only option that you are left with is to use existing solutions, like APIs and libraries, to speed up your go-to-market plan. This reduces development work, allows you to scale indefinitely, saves designing time, and lets you create videos in real-time. This can be implemented in approximately 3-4 weeks and needs only 1 developer’s time!

What do you think? Got any questions? Feel free to drop a comment below! 👇

Pulkit Jain

Pulkit Jain

Engineer • Marketer • Tech Geek • Movie & Comic book nerd • Avid reader • Gaming Addict • Football fanatic #GGMU
  • 0


Leave a Reply

Your email address will not be published.

Get Your Monthly CreativeOps Recap

Everything you need to know about CreativeOps for marketing and design teams, brought to you in minutes.

Thank you for subscribing.