CHANGELOG 27 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516517518519520521522523524525526527528529530531532533534535536537538539540541542543544545546547548549550551552553554555556557558559560561562563564565566567568569570571572573574575576577578579580581582583584585586587588589590591592593594595596597598599600601602603604605606607608609610611612613614615616617618619620621622623624625626627628629630631632633634635636637638639640641642643644645646647648649650651652653654655656657658659660661662663664665666667668669670
  1. 2017-01-09 v1.6.1 "Long Tailed Duck"
  2. This release improves upon the VP9 encoder and speeds up the encoding and
  3. decoding processes.
  4. - Upgrading:
  5. This release is ABI compatible with 1.6.0.
  6. - Enhancements:
  7. Faster VP9 encoding and decoding.
  8. High bit depth builds now provide similar speed for 8 bit encode and decode
  9. for x86 targets. Other platforms and higher bit depth improvements are in
  10. progress.
  11. - Bug Fixes:
  12. A variety of fuzzing issues.
  13. 2016-07-20 v1.6.0 "Khaki Campbell Duck"
  14. This release improves upon the VP9 encoder and speeds up the encoding and
  15. decoding processes.
  16. - Upgrading:
  17. This release is ABI incompatible with 1.5.0 due to a new 'color_range' enum
  18. in vpx_image and some minor changes to the VP8_COMP structure.
  19. The default key frame interval for VP9 has changed from 128 to 9999.
  20. - Enhancement:
  21. A core focus has been performance for low end Intel processors. SSSE3
  22. instructions such as 'pshufb' have been avoided and instructions have been
  23. reordered to better accommodate the more constrained pipelines.
  24. As a result, devices based on Celeron processors have seen substantial
  25. decoding improvements. From Indian Runner Duck to Javan Whistling Duck,
  26. decoding speed improved between 10 and 30%. Between Javan Whistling Duck
  27. and Khaki Campbell Duck, it improved another 10 to 15%.
  28. While Celeron benefited most, Core-i5 also improved 5% and 10% between the
  29. respective releases.
  30. Realtime performance for WebRTC for both speed and quality has received a
  31. lot of attention.
  32. - Bug Fixes:
  33. A number of fuzzing issues, found variously by Mozilla, Chromium and others,
  34. have been fixed and we strongly recommend updating.
  35. 2015-11-09 v1.5.0 "Javan Whistling Duck"
  36. This release improves upon the VP9 encoder and speeds up the encoding and
  37. decoding processes.
  38. - Upgrading:
  39. This release is ABI incompatible with 1.4.0. It drops deprecated VP8
  40. controls and adds a variety of VP9 controls for testing.
  41. The vpxenc utility now prefers VP9 by default.
  42. - Enhancements:
  43. Faster VP9 encoding and decoding
  44. Smaller library size by combining functions used by VP8 and VP9
  45. - Bug Fixes:
  46. A variety of fuzzing issues
  47. 2015-04-03 v1.4.0 "Indian Runner Duck"
  48. This release includes significant improvements to the VP9 codec.
  49. - Upgrading:
  50. This release is ABI incompatible with 1.3.0. It drops the compatibility
  51. layer, requiring VPX_IMG_FMT_* instead of IMG_FMT_*, and adds several codec
  52. controls for VP9.
  53. - Enhancements:
  54. Faster VP9 encoding and decoding
  55. Multithreaded VP9 decoding (tile and frame-based)
  56. Multithreaded VP9 encoding - on by default
  57. YUV 4:2:2 and 4:4:4 support in VP9
  58. 10 and 12bit support in VP9
  59. 64bit ARM support by replacing ARM assembly with intrinsics
  60. - Bug Fixes:
  61. Fixes a VP9 bitstream issue in Profile 1. This only affected non-YUV 4:2:0
  62. files.
  63. - Known Issues:
  64. Frame Parallel decoding fails for segmented and non-420 files.
  65. 2013-11-15 v1.3.0 "Forest"
  66. This release introduces the VP9 codec in a backward-compatible way.
  67. All existing users of VP8 can continue to use the library without
  68. modification. However, some VP8 options do not map to VP9 in the same manner.
  69. The VP9 encoder in this release is not feature complete. Users interested in
  70. the encoder are advised to use the git master branch and discuss issues on
  71. libvpx mailing lists.
  72. - Upgrading:
  73. This release is ABI and API compatible with Duclair (v1.0.0). Users
  74. of older releases should refer to the Upgrading notes in this document
  75. for that release.
  76. - Enhancements:
  77. Get rid of bashisms in the main build scripts
  78. Added usage info on command line options
  79. Add lossless compression mode
  80. Dll build of libvpx
  81. Add additional Mac OS X targets: 10.7, 10.8 and 10.9 (darwin11-13)
  82. Add option to disable documentation
  83. configure: add --enable-external-build support
  84. make: support V=1 as short form of verbose=yes
  85. configure: support mingw-w64
  86. configure: support hardfloat armv7 CHOSTS
  87. configure: add support for android x86
  88. Add estimated completion time to vpxenc
  89. Don't exit on decode errors in vpxenc
  90. vpxenc: support scaling prior to encoding
  91. vpxdec: support scaling output
  92. vpxenc: improve progress indicators with --skip
  93. msvs: Don't link to winmm.lib
  94. Add a new script for producing vcxproj files
  95. Produce Visual Studio 10 and 11 project files
  96. Produce Windows Phone project files
  97. msvs-build: use msbuild for vs >= 2005
  98. configure: default configure log to config.log
  99. Add encoding option --static-thresh
  100. - Speed:
  101. Miscellaneous speed optimizations for VP8 and VP9.
  102. - Quality:
  103. In general, quality is consistent with the Eider release.
  104. - Bug Fixes:
  105. This release represents approximately a year of engineering effort,
  106. and contains multiple bug fixes. Please refer to git history for details.
  107. 2012-12-21 v1.2.0
  108. This release acts as a checkpoint for a large amount of internal refactoring
  109. and testing. It also contains a number of small bugfixes, so all users are
  110. encouraged to upgrade.
  111. - Upgrading:
  112. This release is ABI and API compatible with Duclair (v1.0.0). Users
  113. of older releases should refer to the Upgrading notes in this
  114. document for that release.
  115. - Enhancements:
  116. VP8 optimizations for MIPS dspr2
  117. vpxenc: add -quiet option
  118. - Speed:
  119. Encoder and decoder speed is consistent with the Eider release.
  120. - Quality:
  121. In general, quality is consistent with the Eider release.
  122. Minor tweaks to ARNR filtering
  123. Minor improvements to real time encoding with multiple temporal layers
  124. - Bug Fixes:
  125. Fixes multithreaded encoder race condition in loopfilter
  126. Fixes multi-resolution threaded encoding
  127. Fix potential encoder dead-lock after picture resize
  128. 2012-05-09 v1.1.0 "Eider"
  129. This introduces a number of enhancements, mostly focused on real-time
  130. encoding. In addition, it fixes a decoder bug (first introduced in
  131. Duclair) so all users of that release are encouraged to upgrade.
  132. - Upgrading:
  133. This release is ABI and API compatible with Duclair (v1.0.0). Users
  134. of older releases should refer to the Upgrading notes in this
  135. document for that release.
  136. This release introduces a new temporal denoiser, controlled by the
  137. VP8E_SET_NOISE_SENSITIVITY control. The temporal denoiser does not
  138. currently take a strength parameter, so the control is effectively
  139. a boolean - zero (off) or non-zero (on). For compatibility with
  140. existing applications, the values accepted are the same as those
  141. for the spatial denoiser (0-6). The temporal denoiser is enabled
  142. by default, and the older spatial denoiser may be restored by
  143. configuring with --disable-temporal-denoising. The temporal denoiser
  144. is more computationally intensive than the spatial one.
  145. This release removes support for a legacy, decode only API that was
  146. supported, but deprecated, at the initial release of libvpx
  147. (v0.9.0). This is not expected to have any impact. If you are
  148. impacted, you can apply a reversion to commit 2bf8fb58 locally.
  149. Please update to the latest libvpx API if you are affected.
  150. - Enhancements:
  151. Adds a motion compensated temporal denoiser to the encoder, which
  152. gives higher quality than the older spatial denoiser. (See above
  153. for notes on upgrading).
  154. In addition, support for new compilers and platforms were added,
  155. including:
  156. improved support for XCode
  157. Android x86 NDK build
  158. OS/2 support
  159. SunCC support
  160. Changing resolution with vpx_codec_enc_config_set() is now
  161. supported. Previously, reinitializing the codec was required to
  162. change the input resolution.
  163. The vpxenc application has initial support for producing multiple
  164. encodes from the same input in one call. Resizing is not yet
  165. supported, but varying other codec parameters is. Use -- to
  166. delineate output streams. Options persist from one stream to the
  167. next.
  168. Also, the vpxenc application will now use a keyframe interval of
  169. 5 seconds by default. Use the --kf-max-dist option to override.
  170. - Speed:
  171. Decoder performance improved 2.5% versus Duclair. Encoder speed is
  172. consistent with Duclair for most material. Two pass encoding of
  173. slideshow-like material will see significant improvements.
  174. Large realtime encoding speed gains at a small quality expense are
  175. possible by configuring the on-the-fly bitpacking experiment with
  176. --enable-onthefly-bitpacking. Realtime encoder can be up to 13%
  177. faster (ARM) depending on the number of threads and bitrate
  178. settings. This technique sees constant gain over the 5-16 speed
  179. range. For VC style input the loss seen is up to 0.2dB. See commit
  180. 52cf4dca for further details.
  181. - Quality:
  182. On the whole, quality is consistent with the Duclair release. Some
  183. tweaks:
  184. Reduced blockiness in easy sections by applying a penalty to
  185. intra modes.
  186. Improved quality of static sections (like slideshows) with
  187. two pass encoding.
  188. Improved keyframe sizing with multiple temporal layers
  189. - Bug Fixes:
  190. Corrected alt-ref contribution to frame rate for visible updates
  191. to the alt-ref buffer. This affected applications making manual
  192. usage of the frame reference flags, or temporal layers.
  193. Additional constraints were added to disable multi-frame quality
  194. enhancement (MFQE) in sections of the frame where there is motion.
  195. (#392)
  196. Fixed corruption issues when vpx_codec_enc_config_set() was called
  197. with spatial resampling enabled.
  198. Fixed a decoder error introduced in Duclair where the segmentation
  199. map was not being reinitialized on keyframes (#378)
  200. 2012-01-27 v1.0.0 "Duclair"
  201. Our fourth named release, focused on performance and features related to
  202. real-time encoding. It also fixes a decoder crash bug introduced in
  203. v0.9.7, so all users of that release are encouraged to upgrade.
  204. - Upgrading:
  205. This release is ABI incompatible with prior releases of libvpx, so the
  206. "major" version number has been bumped to 1. You must recompile your
  207. applications against the latest version of the libvpx headers. The
  208. API remains compatible, and this should not require code changes in most
  209. applications.
  210. - Enhancements:
  211. This release introduces several substantial new features to the encoder,
  212. of particular interest to real time streaming applications.
  213. Temporal scalability allows the encoder to produce a stream that can
  214. be decimated to different frame rates, with independent rate targetting
  215. for each substream.
  216. Multiframe quality enhancement postprocessing can make visual quality
  217. more consistent in the presence of frames that are substantially
  218. different quality than the surrounding frames, as in the temporal
  219. scalability case and in some forced keyframe scenarios.
  220. Multiple-resolution encoding support allows the encoding of the
  221. same content at different resolutions faster than encoding them
  222. separately.
  223. - Speed:
  224. Optimization targets for this release included the decoder and the real-
  225. time modes of the encoder. Decoder speed on x86 has improved 10.5% with
  226. this release. Encoder improvements followed a curve where speeds 1-3
  227. improved 4.0%-1.5%, speeds 4-8 improved <1%, and speeds 9-16 improved
  228. 1.5% to 10.5%, respectively. "Best" mode speed is consistent with the
  229. Cayuga release.
  230. - Quality:
  231. Encoder quality in the single stream case is consistent with the Cayuga
  232. release.
  233. - Bug Fixes:
  234. This release fixes an OOB read decoder crash bug present in v0.9.7
  235. related to the clamping of motion vectors in SPLITMV blocks. This
  236. behavior could be triggered by corrupt input or by starting
  237. decoding from a P-frame.
  238. 2011-08-15 v0.9.7-p1 "Cayuga" patch 1
  239. This is an incremental bugfix release against Cayuga. All users of that
  240. release are strongly encouraged to upgrade.
  241. - Fix potential OOB reads (cdae03a)
  242. An unbounded out of bounds read was discovered when the
  243. decoder was requested to perform error concealment (new in
  244. Cayuga) given a frame with corrupt partition sizes.
  245. A bounded out of bounds read was discovered affecting all
  246. versions of libvpx. Given an multipartition input frame that
  247. is truncated between the mode/mv partition and the first
  248. residiual paritition (in the block of partition offsets), up
  249. to 3 extra bytes could have been read from the source buffer.
  250. The code will not take any action regardless of the contents
  251. of these undefined bytes, as the truncated buffer is detected
  252. immediately following the read based on the calculated
  253. starting position of the coefficient partition.
  254. - Fix potential error concealment crash when the very first frame
  255. is missing or corrupt (a609be5)
  256. - Fix significant artifacts in error concealment (a4c2211, 99d870a)
  257. - Revert 1-pass CBR rate control changes (e961317)
  258. Further testing showed this change produced undesirable visual
  259. artifacts, rolling back for now.
  260. 2011-08-02 v0.9.7 "Cayuga"
  261. Our third named release, focused on a faster, higher quality, encoder.
  262. - Upgrading:
  263. This release is backwards compatible with Aylesbury (v0.9.5) and
  264. Bali (v0.9.6). Users of older releases should refer to the Upgrading
  265. notes in this document for that release.
  266. - Enhancements:
  267. Stereo 3D format support for vpxenc
  268. Runtime detection of available processor cores.
  269. Allow specifying --end-usage by enum name
  270. vpxdec: test for frame corruption
  271. vpxenc: add quantizer histogram display
  272. vpxenc: add rate histogram display
  273. Set VPX_FRAME_IS_DROPPABLE
  274. update configure for ios sdk 4.3
  275. Avoid text relocations in ARM vp8 decoder
  276. Generate a vpx.pc file for pkg-config.
  277. New ways of passing encoded data between encoder and decoder.
  278. - Speed:
  279. This release includes across-the-board speed improvements to the
  280. encoder. On x86, these measure at approximately 11.5% in Best mode,
  281. 21.5% in Good mode (speed 0), and 22.5% in Realtime mode (speed 6).
  282. On ARM Cortex A9 with Neon extensions, real-time encoding of video
  283. telephony content is 35% faster than Bali on single core and 48%
  284. faster on multi-core. On the NVidia Tegra2 platform, real time
  285. encoding is 40% faster than Bali.
  286. Decoder speed was not a priority for this release, but improved
  287. approximately 8.4% on x86.
  288. Reduce motion vector search on alt-ref frame.
  289. Encoder loopfilter running in its own thread
  290. Reworked loopfilter to precalculate more parameters
  291. SSE2/SSSE3 optimizations for build_predictors_mbuv{,_s}().
  292. Make hor UV predict ~2x faster (73 vs 132 cycles) using SSSE3.
  293. Removed redundant checks
  294. Reduced structure sizes
  295. utilize preload in ARMv6 MC/LPF/Copy routines
  296. ARM optimized quantization, dfct, variance, subtract
  297. Increase chrow row alignment to 16 bytes.
  298. disable trellis optimization for first pass
  299. Write SSSE3 sub-pixel filter function
  300. Improve SSE2 half-pixel filter funtions
  301. Add vp8_sub_pixel_variance16x8_ssse3 function
  302. Reduce unnecessary distortion computation
  303. Use diamond search to replace full search
  304. Preload reference area in sub-pixel motion search (real-time mode)
  305. - Quality:
  306. This release focused primarily on one-pass use cases, including
  307. video conferencing. Low latency data rate control was significantly
  308. improved, improving streamability over bandwidth constrained links.
  309. Added support for error concealment, allowing frames to maintain
  310. visual quality in the presence of substantial packet loss.
  311. Add rc_max_intra_bitrate_pct control
  312. Limit size of initial keyframe in one-pass.
  313. Improve framerate adaptation
  314. Improved 1-pass CBR rate control
  315. Improved KF insertion after fades to still.
  316. Improved key frame detection.
  317. Improved activity masking (lower PSNR impact for same SSIM boost)
  318. Improved interaction between GF and ARFs
  319. Adding error-concealment to the decoder.
  320. Adding support for independent partitions
  321. Adjusted rate-distortion constants
  322. - Bug Fixes:
  323. Removed firstpass motion map
  324. Fix parallel make install
  325. Fix multithreaded encoding for 1 MB wide frame
  326. Fixed iwalsh_neon build problems with RVDS4.1
  327. Fix semaphore emulation, spin-wait intrinsics on Windows
  328. Fix build with xcode4 and simplify GLOBAL.
  329. Mark ARM asm objects as allowing a non-executable stack.
  330. Fix vpxenc encoding incorrect webm file header on big endian
  331. 2011-03-07 v0.9.6 "Bali"
  332. Our second named release, focused on a faster, higher quality, encoder.
  333. - Upgrading:
  334. This release is backwards compatible with Aylesbury (v0.9.5). Users
  335. of older releases should refer to the Upgrading notes in this
  336. document for that release.
  337. - Enhancements:
  338. vpxenc --psnr shows a summary when encode completes
  339. --tune=ssim option to enable activity masking
  340. improved postproc visualizations for development
  341. updated support for Apple iOS to SDK 4.2
  342. query decoder to determine which reference frames were updated
  343. implemented error tracking in the decoder
  344. fix pipe support on windows
  345. - Speed:
  346. Primary focus was on good quality mode, speed 0. Average improvement
  347. on x86 about 40%, up to 100% on user-generated content at that speed.
  348. Best quality mode speed improved 35%, and realtime speed 10-20%. This
  349. release also saw significant improvement in realtime encoding speed
  350. on ARM platforms.
  351. Improved encoder threading
  352. Dont pick encoder filter level when loopfilter is disabled.
  353. Avoid double copying of key frames into alt and golden buffer
  354. FDCT optimizations.
  355. x86 sse2 temporal filter
  356. SSSE3 version of fast quantizer
  357. vp8_rd_pick_best_mbsegmentation code restructure
  358. Adjusted breakout RD for SPLITMV
  359. Changed segmentation check order
  360. Improved rd_pick_intra4x4block
  361. Adds armv6 optimized variance calculation
  362. ARMv6 optimized sad16x16
  363. ARMv6 optimized half pixel variance calculations
  364. Full search SAD function optimization in SSE4.1
  365. Improve MV prediction accuracy to achieve performance gain
  366. Improve MV prediction in vp8_pick_inter_mode() for speed>3
  367. - Quality:
  368. Best quality mode improved PSNR 6.3%, and SSIM 6.1%. This release
  369. also includes support for "activity masking," which greatly improves
  370. SSIM at the expense of PSNR. For now, this feature is available with
  371. the --tune=ssim option. Further experimentation in this area
  372. is ongoing. This release also introduces a new rate control mode
  373. called "CQ," which changes the allocation of bits within a clip to
  374. the sections where they will have the most visual impact.
  375. Tuning for the more exact quantizer.
  376. Relax rate control for last few frames
  377. CQ Mode
  378. Limit key frame quantizer for forced key frames.
  379. KF/GF Pulsing
  380. Add simple version of activity masking.
  381. make rdmult adaptive for intra in quantizer RDO
  382. cap the best quantizer for 2nd order DC
  383. change the threshold of DC check for encode breakout
  384. - Bug Fixes:
  385. Fix crash on Sparc Solaris.
  386. Fix counter of fixed keyframe distance
  387. ARNR filter pointer update bug fix
  388. Fixed use of motion percentage in KF/GF group calc
  389. Changed condition for using RD in Intra Mode
  390. Fix encoder real-time only configuration.
  391. Fix ARM encoder crash with multiple token partitions
  392. Fixed bug first cluster timecode of webm file is wrong.
  393. Fixed various encoder bugs with odd-sized images
  394. vp8e_get_preview fixed when spatial resampling enabled
  395. quantizer: fix assertion in fast quantizer path
  396. Allocate source buffers to be multiples of 16
  397. Fix for manual Golden frame frequency
  398. Fix drastic undershoot in long form content
  399. 2010-10-28 v0.9.5 "Aylesbury"
  400. Our first named release, focused on a faster decoder, and a better encoder.
  401. - Upgrading:
  402. This release incorporates backwards-incompatible changes to the
  403. ivfenc and ivfdec tools. These tools are now called vpxenc and vpxdec.
  404. vpxdec
  405. * the -q (quiet) option has been removed, and replaced with
  406. -v (verbose). the output is quiet by default. Use -v to see
  407. the version number of the binary.
  408. * The default behavior is now to write output to a single file
  409. instead of individual frames. The -y option has been removed.
  410. Y4M output is the default.
  411. * For raw I420/YV12 output instead of Y4M, the --i420 or --yv12
  412. options must be specified.
  413. $ ivfdec -o OUTPUT INPUT
  414. $ vpxdec --i420 -o OUTPUT INPUT
  415. * If an output file is not specified, the default is to write
  416. Y4M to stdout. This makes piping more natural.
  417. $ ivfdec -y -o - INPUT | ...
  418. $ vpxdec INPUT | ...
  419. * The output file has additional flexibility for formatting the
  420. filename. It supports escape characters for constructing a
  421. filename from the width, height, and sequence number. This
  422. replaces the -p option. To get the equivalent:
  423. $ ivfdec -p frame INPUT
  424. $ vpxdec --i420 -o frame-%wx%h-%4.i420 INPUT
  425. vpxenc
  426. * The output file must be specified with -o, rather than as the
  427. last argument.
  428. $ ivfenc <options> INPUT OUTPUT
  429. $ vpxenc <options> -o OUTPUT INPUT
  430. * The output defaults to webm. To get IVF output, use the --ivf
  431. option.
  432. $ ivfenc <options> INPUT OUTPUT.ivf
  433. $ vpxenc <options> -o OUTPUT.ivf --ivf INPUT
  434. - Enhancements:
  435. ivfenc and ivfdec have been renamed to vpxenc, vpxdec.
  436. vpxdec supports .webm input
  437. vpxdec writes .y4m by default
  438. vpxenc writes .webm output by default
  439. vpxenc --psnr now shows the average/overall PSNR at the end
  440. ARM platforms now support runtime cpu detection
  441. vpxdec visualizations added for motion vectors, block modes, references
  442. vpxdec now silent by default
  443. vpxdec --progress shows frame-by-frame timing information
  444. vpxenc supports the distinction between --fps and --timebase
  445. NASM is now a supported assembler
  446. configure: enable PIC for shared libs by default
  447. configure: add --enable-small
  448. configure: support for ppc32-linux-gcc
  449. configure: support for sparc-solaris-gcc
  450. - Bugs:
  451. Improve handling of invalid frames
  452. Fix valgrind errors in the NEON loop filters.
  453. Fix loopfilter delta zero transitions
  454. Fix valgrind errors in vp8_sixtap_predict8x4_armv6().
  455. Build fixes for darwin-icc
  456. - Speed:
  457. 20-40% (average 28%) improvement in libvpx decoder speed,
  458. including:
  459. Rewrite vp8_short_walsh4x4_sse2()
  460. Optimizations on the loopfilters.
  461. Miscellaneous improvements for Atom
  462. Add 4-tap version of 2nd-pass ARMv6 MC filter.
  463. Improved multithread utilization
  464. Better instruction choices on x86
  465. reorder data to use wider instructions
  466. Update NEON wide idcts
  467. Make block access to frame buffer sequential
  468. Improved subset block search
  469. Bilinear subpixel optimizations for ssse3.
  470. Decrease memory footprint
  471. Encoder speed improvements (percentage gain not measured):
  472. Skip unnecessary search of identical frames
  473. Add SSE2 subtract functions
  474. Improve bounds checking in vp8_diamond_search_sadx4()
  475. Added vp8_fast_quantize_b_sse2
  476. - Quality:
  477. Over 7% overall PSNR improvement (6.3% SSIM) in "best" quality
  478. encoding mode, and up to 60% improvement on very noisy, still
  479. or slow moving source video
  480. Motion compensated temporal filter for Alt-Ref Noise Reduction
  481. Improved use of trellis quantization on 2nd order Y blocks
  482. Tune effect of motion on KF/GF boost in two pass
  483. Allow coefficient optimization for good quality speed 0.
  484. Improved control of active min quantizer for two pass.
  485. Enable ARFs for non-lagged compress
  486. 2010-09-02 v0.9.2
  487. - Enhancements:
  488. Disable frame dropping by default
  489. Improved multithreaded performance
  490. Improved Force Key Frame Behaviour
  491. Increased rate control buffer level precision
  492. Fix bug in 1st pass motion compensation
  493. ivfenc: correct fixed kf interval, --disable-kf
  494. - Speed:
  495. Changed above and left context data layout
  496. Rework idct calling structure.
  497. Removed unnecessary MB_MODE_INFO copies
  498. x86: SSSE3 sixtap prediction
  499. Reworked IDCT to include reconstruction (add) step
  500. Swap alt/gold/new/last frame buffer ptrs instead of copying.
  501. Improve SSE2 loopfilter functions
  502. Change bitreader to use a larger window.
  503. Avoid loopfilter reinitialization when possible
  504. - Quality:
  505. Normalize quantizer's zero bin and rounding factors
  506. Add trellis quantization.
  507. Make the quantizer exact.
  508. Updates to ARNR filtering algorithm
  509. Fix breakout thresh computation for golden & AltRef frames
  510. Redo the forward 4x4 dct
  511. Improve the accuracy of forward walsh-hadamard transform
  512. Further adjustment of RD behaviour with Q and Zbin.
  513. - Build System:
  514. Allow linking of libs built with MinGW to MSVC
  515. Fix target auto-detection on mingw32
  516. Allow --cpu= to work for x86.
  517. configure: pass original arguments through to make dist
  518. Fix builds without runtime CPU detection
  519. msvs: fix install of codec sources
  520. msvs: Change devenv.com command line for better msys support
  521. msvs: Add vs9 targets.
  522. Add x86_64-linux-icc target
  523. - Bugs:
  524. Potential crashes on older MinGW builds
  525. Fix two-pass framrate for Y4M input.
  526. Fixed simple loop filter, other crashes on ARM v6
  527. arm: fix missing dependency with --enable-shared
  528. configure: support directories containing .o
  529. Replace pinsrw (SSE) with MMX instructions
  530. apple: include proper mach primatives
  531. Fixed rate control bug with long key frame interval.
  532. Fix DSO link errors on x86-64 when not using a version script
  533. Fixed buffer selection for UV in AltRef filtering
  534. 2010-06-17 v0.9.1
  535. - Enhancements:
  536. * ivfenc/ivfdec now support YUV4MPEG2 input and pipe I/O
  537. * Speed optimizations
  538. - Bugfixes:
  539. * Rate control
  540. * Prevent out-of-bounds accesses on invalid data
  541. - Build system updates:
  542. * Detect toolchain to be used automatically for native builds
  543. * Support building shared libraries
  544. * Better autotools emulation (--prefix, --libdir, DESTDIR)
  545. - Updated LICENSE
  546. * http://webmproject.blogspot.com/2010/06/changes-to-webm-open-source-license.html
  547. 2010-05-18 v0.9.0
  548. - Initial open source release. Welcome to WebM and VP8!