changes.html 14 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360
  1. <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
  2. <html>
  3. <head>
  4. <title>LuaJIT Change History</title>
  5. <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
  6. <meta name="Author" content="Mike Pall">
  7. <meta name="Copyright" content="Copyright (C) 2005-2010, Mike Pall">
  8. <meta name="Language" content="en">
  9. <link rel="stylesheet" type="text/css" href="bluequad.css" media="screen">
  10. <link rel="stylesheet" type="text/css" href="bluequad-print.css" media="print">
  11. <style type="text/css">
  12. div.major { max-width: 600px; padding: 1em; margin: 1em 0 1em 0; }
  13. </style>
  14. </head>
  15. <body>
  16. <div id="site">
  17. <a href="http://luajit.org"><span>Lua<span id="logo">JIT</span></span></a>
  18. </div>
  19. <div id="head">
  20. <h1>LuaJIT Change History</h1>
  21. </div>
  22. <div id="nav">
  23. <ul><li>
  24. <a href="luajit.html">LuaJIT</a>
  25. <ul><li>
  26. <a href="install.html">Installation</a>
  27. </li><li>
  28. <a href="running.html">Running</a>
  29. </li><li>
  30. <a href="api.html">API Extensions</a>
  31. </li></ul>
  32. </li><li>
  33. <a href="status.html">Status</a>
  34. <ul><li>
  35. <a class="current" href="changes.html">Changes</a>
  36. </li></ul>
  37. </li><li>
  38. <a href="faq.html">FAQ</a>
  39. </li><li>
  40. <a href="http://luajit.org/download.html">Download <span class="ext">&raquo;</span></a>
  41. </li></ul>
  42. </div>
  43. <div id="main">
  44. <p>
  45. This is a list of changes between the released versions of LuaJIT.<br>
  46. The current <span style="color: #c00000;">development version</span> is <strong>LuaJIT&nbsp;2.0.0-beta2</strong>.<br>
  47. The current <span style="color: #0000c0;">stable version</span> is <strong>LuaJIT&nbsp;1.1.5</strong>.
  48. </p>
  49. <p>
  50. Please check the
  51. <a href="http://luajit.org/changes.html"><span class="ext">&raquo;</span>&nbsp;Online Change History</a>
  52. to see whether newer versions are available.
  53. </p>
  54. <div class="major" style="background: #d0d0d0;">
  55. <h2 id="snap">Development Snapshot</h2>
  56. <ul>
  57. <li>CPU support:
  58. <ul>
  59. <li>Port integrated memory allocator to Linux/x64 and Windows/x64.</li>
  60. <li>Port interpreter and JIT compiler to x64.</li>
  61. <li>Port DynASM to x64.</li>
  62. <li>Many 32/64 bit cleanups in the VM.</li>
  63. <li>Allow building the interpreter with either x87 or SSE2 arithmetics.</li>
  64. <li>Disable JIT compiler on older non-SSE2 CPUs instead of aborting.</li>
  65. </ul></li>
  66. <li>Correctness and completeness:
  67. <ul>
  68. <li>Fix constructor bytecode generation for certain conditional values.</li>
  69. <li>Fix some cases of ordered string comparisons.</li>
  70. <li>Fix <tt>lua_tocfunction()</tt>.</li>
  71. <li>Fix cutoff register in JMP bytecode for some conditional expressions.</li>
  72. <li>Fix PHI marking algorithm for references from variant slots.</li>
  73. <li>Fix <tt>package.cpath</tt> for non-default PREFIX.</li>
  74. <li>Fix DWARF2 frame unwind information for interpreter on OSX.</li>
  75. <li>Drive the GC forward on string allocations in the parser.</li>
  76. <li>Implement call/return hooks (zero-cost if disabled).</li>
  77. <li>Implement yield from C hooks.</li>
  78. <li>Add external unwinding and C++ exception interop (default on x64).</li>
  79. </ul></li>
  80. <li>Structural and performance enhancements:
  81. <ul>
  82. <li>Improve heuristics for bytecode penalties and blacklisting.</li>
  83. <li>Split CALL/FUNC recording and clean up fast function call semantics.</li>
  84. <li>Major redesign of internal function call handling.</li>
  85. <li>Improve FOR loop const specialization and integerness checks.</li>
  86. <li>Switch to pre-initialized stacks. Avoid frame-clearing.</li>
  87. <li>Colocation of prototypes and related data: bytecode, constants, debug info.</li>
  88. <li>Cleanup parser and streamline bytecode generation.</li>
  89. <li>Add support for weak IR references to register allocator.</li>
  90. <li>Switch to compressed, extensible snapshots.</li>
  91. <li>Compile returns to frames below the start frame.</li>
  92. <li>Improve alias analysis of upvalues using a disambiguation hash value.</li>
  93. <li>Compile floor/ceil/trunc to SSE2 helper calls or SSE4.1 instructions.</li>
  94. <li>Add generic C call handling to IR and backend.</li>
  95. <li>Improve KNUM fuse vs. load heuristics.</li>
  96. <li>Compile various <tt>io.*()</tt> functions.</li>
  97. <li>Compile <tt>math.sinh()</tt>, <tt>math.cosh()</tt>, <tt>math.tanh()</tt>
  98. and <tt>math.random()</tt>.</li>
  99. </ul></li>
  100. </ul>
  101. </div>
  102. <div class="major" style="background: #ffd0d0;">
  103. <h2 id="LuaJIT-2.0.0-beta2">LuaJIT 2.0.0-beta2 &mdash; 2009-11-09</h2>
  104. <ul>
  105. <li>Reorganize build system. Build static+shared library on POSIX.</li>
  106. <li>Allow C++ exception conversion on all platforms
  107. using a wrapper function.</li>
  108. <li>Automatically catch C++ exceptions and rethrow Lua error
  109. (DWARF2 only).</li>
  110. <li>Check for the correct x87 FPU precision at strategic points.</li>
  111. <li>Always use wrappers for libm functions.</li>
  112. <li>Resurrect metamethod name strings before copying them.</li>
  113. <li>Mark current trace, even if compiler is idle.</li>
  114. <li>Ensure FILE metatable is created only once.</li>
  115. <li>Fix type comparisons when different integer types are involved.</li>
  116. <li>Fix <tt>getmetatable()</tt> recording.</li>
  117. <li>Fix TDUP with dead keys in template table.</li>
  118. <li><tt>jit.flush(tr)</tt> returns status.
  119. Prevent manual flush of a trace that's still linked.</li>
  120. <li>Improve register allocation heuristics for invariant references.</li>
  121. <li>Compile the push/pop variants of <tt>table.insert()</tt> and
  122. <tt>table.remove()</tt>.</li>
  123. <li>Compatibility with MSVC <tt>link&nbsp/debug</tt>.</li>
  124. <li>Fix <tt>lua_iscfunction()</tt>.</li>
  125. <li>Fix <tt>math.random()</tt> when compiled with <tt>-fpic</tt> (OSX).</li>
  126. <li>Fix <tt>table.maxn()</tt>.</li>
  127. <li>Bump <tt>MACOSX_DEPLOYMENT_TARGET</tt> to <tt>10.4</tt></li>
  128. <li><tt>luaL_check*()</tt> and <tt>luaL_opt*()</tt> now support
  129. negative arguments, too.<br>
  130. This matches the behavior of Lua 5.1, but not the specification.</li>
  131. </ul>
  132. <h2 id="LuaJIT-2.0.0-beta1">LuaJIT 2.0.0-beta1 &mdash; 2009-10-31</h2>
  133. <ul>
  134. <li>This is the first public release of LuaJIT 2.0.</li>
  135. <li>The whole VM has been rewritten from the ground up, so there's
  136. no point in listing differences over earlier versions.</li>
  137. </ul>
  138. </div>
  139. <div class="major" style="background: #d0d0ff;">
  140. <h2 id="LuaJIT-1.1.5">LuaJIT 1.1.5 &mdash; 2008-10-25</h2>
  141. <ul>
  142. <li>Merged with Lua 5.1.4. Fixes all
  143. <a href="http://www.lua.org/bugs.html#5.1.3"><span class="ext">&raquo;</span>&nbsp;known bugs in Lua 5.1.3</a>.</li>
  144. </ul>
  145. <h2 id="LuaJIT-1.1.4">LuaJIT 1.1.4 &mdash; 2008-02-05</h2>
  146. <ul>
  147. <li>Merged with Lua 5.1.3. Fixes all
  148. <a href="http://www.lua.org/bugs.html#5.1.2"><span class="ext">&raquo;</span>&nbsp;known bugs in Lua 5.1.2</a>.</li>
  149. <li>Fixed possible (but unlikely) stack corruption while compiling
  150. <tt>k^x</tt> expressions.</li>
  151. <li>Fixed DynASM template for cmpss instruction.</li>
  152. </ul>
  153. <h2 id="LuaJIT-1.1.3">LuaJIT 1.1.3 &mdash; 2007-05-24</h2>
  154. <ul>
  155. <li>Merged with Lua 5.1.2. Fixes all
  156. <a href="http://www.lua.org/bugs.html#5.1.1"><span class="ext">&raquo;</span>&nbsp;known bugs in Lua 5.1.1</a>.</li>
  157. <li>Merged pending Lua 5.1.x fixes: "return -nil" bug, spurious count hook call.</li>
  158. <li>Remove a (sometimes) wrong assertion in <tt>luaJIT_findpc()</tt>.</li>
  159. <li>DynASM now allows labels for displacements and <tt>.aword</tt>.</li>
  160. <li>Fix some compiler warnings for DynASM glue (internal API change).</li>
  161. <li>Correct naming for SSSE3 (temporarily known as SSE4) in DynASM and x86 disassembler.</li>
  162. <li>The loadable debug modules now handle redirection to stdout
  163. (e.g. <tt>-j&nbsp;trace=-</tt>).</li>
  164. </ul>
  165. <h2 id="LuaJIT-1.1.2">LuaJIT 1.1.2 &mdash; 2006-06-24</h2>
  166. <ul>
  167. <li>Fix MSVC inline assembly: use only local variables with
  168. <tt>lua_number2int()</tt>.</li>
  169. <li>Fix "attempt to call a thread value" bug on Mac OS X:
  170. make values of consts used as lightuserdata keys unique
  171. to avoid joining by the compiler/linker.</li>
  172. </ul>
  173. <h2 id="LuaJIT-1.1.1">LuaJIT 1.1.1 &mdash; 2006-06-20</h2>
  174. <ul>
  175. <li>Merged with Lua 5.1.1. Fixes all
  176. <a href="http://www.lua.org/bugs.html#5.1"><span class="ext">&raquo;</span>&nbsp;known bugs in Lua 5.1</a>.</li>
  177. <li>Enforce (dynamic) linker error for EXE/DLL version mismatches.</li>
  178. <li>Minor changes to DynASM: faster preprocessing, smaller encoding
  179. for some immediates.</li>
  180. </ul>
  181. <p>
  182. This release is in sync with Coco 1.1.1 (see the
  183. <a href="http://coco.luajit.org/changes.html"><span class="ext">&raquo;</span>&nbsp;Coco Change History</a>).
  184. </p>
  185. <h2 id="LuaJIT-1.1.0">LuaJIT 1.1.0 &mdash; 2006-03-13</h2>
  186. <ul>
  187. <li>Merged with Lua 5.1 (final).</li>
  188. <li>New JIT call frame setup:
  189. <ul>
  190. <li>The C stack is kept 16 byte aligned (faster).
  191. Mandatory for Mac OS X on Intel, too.</li>
  192. <li>Faster calling conventions for internal C helper functions.</li>
  193. <li>Better instruction scheduling for function prologue, OP_CALL and
  194. OP_RETURN.</li>
  195. </ul></li>
  196. <li>Miscellaneous optimizations:
  197. <ul>
  198. <li>Faster loads of FP constants. Remove narrow-to-wide store-to-load
  199. forwarding stalls.</li>
  200. <li>Use (scalar) SSE2 ops (if the CPU supports it) to speed up slot moves
  201. and FP to integer conversions.</li>
  202. <li>Optimized the two-argument form of <tt>OP_CONCAT</tt> (<tt>a..b</tt>).</li>
  203. <li>Inlined <tt>OP_MOD</tt> (<tt>a%b</tt>).
  204. With better accuracy than the C variant, too.</li>
  205. <li>Inlined <tt>OP_POW</tt> (<tt>a^b</tt>). Unroll <tt>x^k</tt> or
  206. use <tt>k^x = 2^(log2(k)*x)</tt> or call <tt>pow()</tt>.</li>
  207. </ul></li>
  208. <li>Changes in the optimizer:
  209. <ul>
  210. <li>Improved hinting for table keys derived from table values
  211. (<tt>t1[t2[x]]</tt>).</li>
  212. <li>Lookup hinting now works with arbitrary object types and
  213. supports index chains, too.</li>
  214. <li>Generate type hints for arithmetic and comparison operators,
  215. OP_LEN, OP_CONCAT and OP_FORPREP.</li>
  216. <li>Remove several hint definitions in favour of a generic COMBINE hint.</li>
  217. <li>Complete rewrite of <tt>jit.opt_inline</tt> module
  218. (ex <tt>jit.opt_lib</tt>).</li>
  219. </ul></li>
  220. <li>Use adaptive deoptimization:
  221. <ul>
  222. <li>If runtime verification of a contract fails, the affected
  223. instruction is recompiled and patched on-the-fly.
  224. Regular programs will trigger deoptimization only occasionally.</li>
  225. <li>This avoids generating code for uncommon fallback cases
  226. most of the time. Generated code is up to 30% smaller compared to
  227. LuaJIT&nbsp;1.0.3.</li>
  228. <li>Deoptimization is used for many opcodes and contracts:
  229. <ul>
  230. <li>OP_CALL, OP_TAILCALL: type mismatch for callable.</li>
  231. <li>Inlined calls: closure mismatch, parameter number and type mismatches.</li>
  232. <li>OP_GETTABLE, OP_SETTABLE: table or key type and range mismatches.</li>
  233. <li>All arithmetic and comparison operators, OP_LEN, OP_CONCAT,
  234. OP_FORPREP: operand type and range mismatches.</li>
  235. </ul></li>
  236. <li>Complete redesign of the debug and traceback info
  237. (bytecode &harr; mcode) to support deoptimization.
  238. Much more flexible and needs only 50% of the space.</li>
  239. <li>The modules <tt>jit.trace</tt>, <tt>jit.dumphints</tt> and
  240. <tt>jit.dump</tt> handle deoptimization.</li>
  241. </ul></li>
  242. <li>Inlined many popular library functions
  243. (for commonly used arguments only):
  244. <ul>
  245. <li>Most <tt>math.*</tt> functions (the 18 most used ones)
  246. [2x-10x faster].</li>
  247. <li><tt>string.len</tt>, <tt>string.sub</tt> and <tt>string.char</tt>
  248. [2x-10x faster].</li>
  249. <li><tt>table.insert</tt>, <tt>table.remove</tt> and <tt>table.getn</tt>
  250. [3x-5x faster].</li>
  251. <li><tt>coroutine.yield</tt> and <tt>coroutine.resume</tt>
  252. [3x-5x faster].</li>
  253. <li><tt>pairs</tt>, <tt>ipairs</tt> and the corresponding iterators
  254. [8x-15x faster].</li>
  255. </ul></li>
  256. <li>Changes in the core and loadable modules and the stand-alone executable:
  257. <ul>
  258. <li>Added <tt>jit.version</tt>, <tt>jit.version_num</tt>
  259. and <tt>jit.arch</tt>.</li>
  260. <li>Reorganized some internal API functions (<tt>jit.util.*mcode*</tt>).</li>
  261. <li>The <tt>-j dump</tt> output now shows JSUB names, too.</li>
  262. <li>New x86 disassembler module written in pure Lua. No dependency
  263. on ndisasm anymore. Flexible API, very compact (500 lines)
  264. and complete (x87, MMX, SSE, SSE2, SSE3, SSSE3, privileged instructions).</li>
  265. <li><tt>luajit -v</tt> prints the LuaJIT version and copyright
  266. on a separate line.</li>
  267. </ul></li>
  268. <li>Added SSE, SSE2, SSE3 and SSSE3 support to DynASM.</li>
  269. <li>Miscellaneous doc changes. Added a section about
  270. <a href="install.html#embedding">embedding LuaJIT</a>.</li>
  271. </ul>
  272. <p>
  273. This release is in sync with Coco 1.1.0 (see the
  274. <a href="http://coco.luajit.org/changes.html"><span class="ext">&raquo;</span>&nbsp;Coco Change History</a>).
  275. </p>
  276. </div>
  277. <div class="major" style="background: #ffffd0;">
  278. <h2 id="LuaJIT-1.0.3">LuaJIT 1.0.3 &mdash; 2005-09-08</h2>
  279. <ul>
  280. <li>Even more docs.</li>
  281. <li>Unified closure checks in <tt>jit.*</tt>.</li>
  282. <li>Fixed some range checks in <tt>jit.util.*</tt>.</li>
  283. <li>Fixed __newindex call originating from <tt>jit_settable_str()</tt>.</li>
  284. <li>Merged with Lua 5.1 alpha (including early bugfixes).</li>
  285. </ul>
  286. <p>
  287. This is the first public release of LuaJIT.
  288. </p>
  289. <h2 id="LuaJIT-1.0.2">LuaJIT 1.0.2 &mdash; 2005-09-02</h2>
  290. <ul>
  291. <li>Add support for flushing the Valgrind translation cache <br>
  292. (<tt>MYCFLAGS= -DUSE_VALGRIND</tt>).</li>
  293. <li>Add support for freeing executable mcode memory to the <tt>mmap()</tt>-based
  294. variant for POSIX systems.</li>
  295. <li>Reorganized the C&nbsp;function signature handling in
  296. <tt>jit.opt_lib</tt>.</li>
  297. <li>Changed to index-based hints for inlining C&nbsp;functions.
  298. Still no support in the backend for inlining.</li>
  299. <li>Hardcode <tt>HEAP_CREATE_ENABLE_EXECUTE</tt> value if undefined.</li>
  300. <li>Misc. changes to the <tt>jit.*</tt> modules.</li>
  301. <li>Misc. changes to the Makefiles.</li>
  302. <li>Lots of new docs.</li>
  303. <li>Complete doc reorg.</li>
  304. </ul>
  305. <p>
  306. Not released because Lua 5.1 alpha came out today.
  307. </p>
  308. <h2 id="LuaJIT-1.0.1">LuaJIT 1.0.1 &mdash; 2005-08-31</h2>
  309. <ul>
  310. <li>Missing GC step in <tt>OP_CONCAT</tt>.</li>
  311. <li>Fix result handling for C &ndash;> JIT calls.</li>
  312. <li>Detect CPU feature bits.</li>
  313. <li>Encode conditional moves (<tt>fucomip</tt>) only when supported.</li>
  314. <li>Add fallback instructions for FP compares.</li>
  315. <li>Add support for <tt>LUA_COMPAT_VARARG</tt>. Still disabled by default.</li>
  316. <li>MSVC needs a specific place for the <tt>CALLBACK</tt> attribute
  317. (David Burgess).</li>
  318. <li>Misc. doc updates.</li>
  319. </ul>
  320. <p>
  321. Interim non-public release.
  322. Special thanks to Adam D. Moss for reporting most of the bugs.
  323. </p>
  324. <h2 id="LuaJIT-1.0.0">LuaJIT 1.0.0 &mdash; 2005-08-29</h2>
  325. <p>
  326. This is the initial non-public release of LuaJIT.
  327. </p>
  328. </div>
  329. <br class="flush">
  330. </div>
  331. <div id="foot">
  332. <hr class="hide">
  333. Copyright &copy; 2005-2010 Mike Pall
  334. <span class="noprint">
  335. &middot;
  336. <a href="contact.html">Contact</a>
  337. </span>
  338. </div>
  339. </body>
  340. </html>