Mozilla didn't have a choice. VP9/WebM is not happening. The primary reason is almost all inter frame content is acquired or transmitted as H.264. Mobile phones, prosumer cameras, network based encoding appliances, PVR's, cable/satellite tv. No one wants to re-encode. Plus there is a lot of baked in hardware support for H.264 encoding/decoding.
Exactly. Does anyone know if there are any cheap solutions to capture in vp9 on devices such as cell phones?
I'm guessing this is Cisco's motive as well -- they want to sell video conferencing and ip video/voice-chat stuff -- and that'll have to work with cellphones and tablets. As long as almost all devices (including PCs via video hardware) have hw support for h.264 -- and no support for anything else -- we'll be stuck on h.264 for "cross platform" video.