Graduation Thesis

Graduation Thesis(畢業論文) Back

Graduation Thesis(畢業論文) Back

Title: Dynamical Content Injection of videos(視頻動態植入內容)

Tools: OpenCV, Ffmpeg, Imagemagick

Translation: works

Progress

For entities whose corners are easy to track like quadrangles, we're going to use LucasKanade Optical Flow Tracking to complete injection.
- the first thing to do is to use Feature Detection to detect strong corners in the first frame of the video.
- then we can find out four corners of those entities to choose
- after chosing 4 points, we will continue to track these points in the videos, and output positions of these 4 points in the following frames.
- with data of those points, we can present an other layer of image above the video with CSS in HTML.
For entities whose have not any corners, there are some problems to solute (we are going to detect the motion of the camera.)
- how to detect whether the image of the video has changed suddenly
  - Solution: use a feature point to track and monitoring the distance between the prev frame and the next frame. When the distance is more than a specific value.
- how to judge whether the camera is static.
  - Testing: tracking all the points and find out two points which has not moved too much, then we will consider it to be static.
- how to calculate the shift of x-axis and y-axis
- how to calculate the shift of z-axis
Reverse tracking for calculating the upper edge frame:
- there are some problems about OpenCV, because it cannot jump to a specific frame correctly, and tracking in reverse order should use VideoCapture::set(CV_CAP_PROP_POS_FRAMES, n) to reset.
Face replacement
- real time problem
- image size problem

Problems

the func VideoCapture::set(CV_CAP_PROP_POS_FRAMES, n) will lose a little precision in different systems, due to its computions of float type.
- Solution: use while to read from the start, but it will cause severe problem of efficiency.

how to generate rebuild video with an overlay image.

Solution:

i. Calc data with video and tracking points(video_width, video_height, 4 points, area_width, area_height)

ii. Contract all the frames with ffmpeg

ffmpeg -i input.mp4 -r "[video-rate]" "frame/f_%1d.png"

iii. Start to combine two image

Get the image size with Imagemagick (logo_width, logo_height):

identify -format "{\"width\": %[fx:w], \"height\": %[fx:h]}" overlay.png
# or 
identify -format "{\"width\": %w, \"height\": %h}" overlay.png

Add padding to an image with Imagemagick:
- area_width > area_height:
  - overlay_pad_width = logo_height 2 (area_width / area_height)
  - overlay_pad_height = logo_height * 2
- area_width <= area_height:
  - overlay_pad_width = logo_width * 2
  - overlay_pad_height = logo_width 2 (area_height / area_width)
```
convert "overlay.png" -background transparent -gravity center -extent "[overlay_pad_width]x[overlay_pad_height]" overlay_pad.png
```

Change the image's size to the size of video and change perspective and it depends:

the image is smaller than the video:

add padding:

convert "overlay_pad.png" -background transparent -extent "[video_width]x[video_height]" mask.png

perspective:

convert "mask.png" -matte -mattecolor transparent -virtual-pixel transparent\
-distort Perspective\
'0,0 point[0].x,point[0].y\
0,overlay_pad_height point[1].x,point[1].y\
overlay_pad_width,0 point[2].x,point[2].y\
overlay_pad_width,overlay_pad_height point[3].x,point[3].y' output.png

the image is larger than the video:

crop without changing the ratio:

convert "overlay_pad.png" -resize "[video_width]x[video_height]" resize.png
convert "resize.png" -background transparent -extent "[video_width]x[video_height]" mask.png

get size of resize.png (resize_width, resize_height)

identify -format "{\"width\": %[fx:w], \"height\": %[fx:h]}" resize.png
# or 
identify -format "{\"width\": %w, \"height\": %h}" resize.png

perspective:

convert "mask.png" -matte -mattecolor transparent -virtual-pixel transparent\
-distort Perspective\
'0,0 point[0].x,point[0].y\
0,resize_height point[1].x,point[1].y\
resize_width,0 point[2].x,point[2].y\
resize_width,resize_height point[3].x,point[3].y' output.png

put text to image

Solution:

convert \
     -size 165x70 \
  xc:lightblue \
  -font Bookman-DemiItalic \
  -pointsize 12 \
  -fill blue \
  -gravity center \
  -draw "text 0,0 '$(cat file.txt)'" \
  image.png

handle real time live stream
- Solution:
  1. read camera from a network stream
```
VideoCapture cam;
```

Performance

Handle Frames

video size	video length	handled frames number	result time
1920x1080	79s	47	10564ms
1920x1080	79s	394	1587437ms

Empty Comments

As the plugin is integrated with a code management system like GitLab or GitHub, you may have to auth with your account before leaving comments around this article.

Notice: This plugin has used Cookie to store your token with an expiration.

Aleen^®

More than a coder, more than a designer

modified at 2016-04-29 17:26:01

20 issues reported

#35 [思] Handlebars 模板应该如何进行预处理2019-01-31 13:42:05Inspiration!

#32 [集] A collection of confusing problems met when developing JavaScript under IE2022-01-11 14:03:30Tasks!

#30 [思] 当需要传递多个不定参数时，该如何设计 JavaScript 函数？2017-05-29 09:42:07Inspiration!

#29 [歸納] 304 Status Code 下的头部信息2018-03-19 12:25:21Communication!Summary!

#28 [歸納] 你以为 JavaScript 没数据结构么？2018-03-19 12:25:04Communication!Summary!

#27 [歸納] JavaScript 之高性能2019-09-20 13:56:28Communication!Summary!

#26 Microtasks? Macrotasks?2017-02-26 18:05:36Communication!