My PhD (before 10+ years) was in video search and one of the proposed methods fo...

Breakthrough · on Sept 1, 2019

Author of PySceneDetect here. The current implementation does exactly what you hint at, except instead of YUV, it considers deltas in the HSV domain (specifically differences in hue and color).

Other techniques being considered for future work include use of optical flow, background subtraction, and analyzing histograms.

spapas82 · on Sept 1, 2019

From what I remember the Y (luma) component in a YUV video has more information than the other two components and it could also be extracted without the need to fully decompress the video (in mpeg compressed videos). Of course this info is more than 10 years old (I don't really do any video research any more) so I guess there should have been progress in that area.

Breakthrough · on Sept 1, 2019

This is indeed correct, I'm just using HSV instead of YUV, but the primary source of information is the luma/brightness component (although currently all 3 of the HSV components are averaged, so perhaps a better weighting may improve precision).

amelius · on Sept 1, 2019

What if the shot is of two people having a conversation in a disco, with lots of lights flashing (stroboscope), etc?

adrianh · on Sept 1, 2019

Super interesting! Mind explaining what an Intra macroblock is?

spapas82 · on Sept 1, 2019

The image for an MPEG compressed video is splitted in 16x16 blocks. For each frame (excluding the 1st of course), the compressing algorithm tries to to matchthat particular block with a block in the previous frame (it searches the previous frame to see where there are the fewest differences). If it can do it it only encodes the differences and the position in the previous frame; this is called an inter(predicted) block. If however it can't match that block with the previous frame then it needs to re-encode from scrach; that's the intra macroblock. As you can understand after a shot cut there will be much more intra macroblocks.

There are some more info in the wikipedia: https://en.wikipedia.org/wiki/Macroblock and also there's a nice article explaining some of the magics of H264 compression: https://sidbala.com/h-264-is-magic/

Uhhrrr · on Sept 1, 2019

Intra search takes place in the same frame, using the MBs encoded so far in that frame.