status.html 9.9 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257
  1. <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
  2. <html>
  3. <head>
  4. <title>Status &amp; Roadmap</title>
  5. <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
  6. <meta name="Author" content="Mike Pall">
  7. <meta name="Copyright" content="Copyright (C) 2005-2011, Mike Pall">
  8. <meta name="Language" content="en">
  9. <link rel="stylesheet" type="text/css" href="bluequad.css" media="screen">
  10. <link rel="stylesheet" type="text/css" href="bluequad-print.css" media="print">
  11. <style type="text/css">
  12. ul li { padding-bottom: 0.3em; }
  13. </style>
  14. </head>
  15. <body>
  16. <div id="site">
  17. <a href="http://luajit.org"><span>Lua<span id="logo">JIT</span></span></a>
  18. </div>
  19. <div id="head">
  20. <h1>Status &amp; Roadmap</h1>
  21. </div>
  22. <div id="nav">
  23. <ul><li>
  24. <a href="luajit.html">LuaJIT</a>
  25. <ul><li>
  26. <a href="install.html">Installation</a>
  27. </li><li>
  28. <a href="running.html">Running</a>
  29. </li></ul>
  30. </li><li>
  31. <a href="extensions.html">Extensions</a>
  32. <ul><li>
  33. <a href="ext_ffi.html">FFI Library</a>
  34. <ul><li>
  35. <a href="ext_ffi_tutorial.html">FFI Tutorial</a>
  36. </li><li>
  37. <a href="ext_ffi_api.html">ffi.* API</a>
  38. </li><li>
  39. <a href="ext_ffi_semantics.html">FFI Semantics</a>
  40. </li></ul>
  41. </li><li>
  42. <a href="ext_jit.html">jit.* Library</a>
  43. </li><li>
  44. <a href="ext_c_api.html">Lua/C API</a>
  45. </li></ul>
  46. </li><li>
  47. <a class="current" href="status.html">Status</a>
  48. <ul><li>
  49. <a href="changes.html">Changes</a>
  50. </li></ul>
  51. </li><li>
  52. <a href="faq.html">FAQ</a>
  53. </li><li>
  54. <a href="http://luajit.org/performance.html">Performance <span class="ext">&raquo;</span></a>
  55. </li><li>
  56. <a href="http://luajit.org/download.html">Download <span class="ext">&raquo;</span></a>
  57. </li></ul>
  58. </div>
  59. <div id="main">
  60. <p>
  61. The <span style="color: #0000c0;">LuaJIT 1.x</span> series represents
  62. the current <span style="color: #0000c0;">stable branch</span>.
  63. Only a single bug has been discovered in the last two years. So, if
  64. you need a rock-solid VM, you are encouraged to fetch the latest
  65. release of LuaJIT 1.x from the <a href="http://luajit.org/download.html"><span class="ext">&raquo;</span>&nbsp;Download</a>
  66. page.
  67. </p>
  68. <p>
  69. <span style="color: #c00000;">LuaJIT 2.0</span> is the currently active
  70. <span style="color: #c00000;">development branch</span>.
  71. It has <b>Beta Test</b> status and is still undergoing
  72. substantial changes.
  73. It has <a href="http://luajit.org/performance.html"><span class="ext">&raquo;</span>&nbsp;much better performance</a> than LuaJIT 1.x.
  74. It's maturing quickly, so you should definitely
  75. start to evaluate it for new projects right now.
  76. </p>
  77. <h2>Current Status</h2>
  78. <p>
  79. This is a list of the things you should know about the LuaJIT 2.0 beta test:
  80. </p>
  81. <ul>
  82. <li>
  83. Obviously there will be many <b>bugs</b> in a VM which has been
  84. rewritten from the ground up. Please report your findings together with
  85. the circumstances needed to reproduce the bug. If possible, reduce the
  86. problem down to a simple test case.<br>
  87. There is no formal bug tracker at the moment. The best place for
  88. discussion is the
  89. <a href="http://www.lua.org/lua-l.html"><span class="ext">&raquo;</span>&nbsp;Lua mailing list</a>. Of course
  90. you may also send your bug reports <a href="contact.html">directly to me</a>,
  91. especially when they contain lengthy debug output or if you require
  92. confidentiality.
  93. </li>
  94. <li>
  95. The x86 JIT compiler only generates code for CPUs with support for
  96. <b>SSE2</b> instructions. I.e. you need at least a P4, Core 2/i3/i5/i7,
  97. Atom or K8/K10 to get the full benefit.<br>
  98. If you run LuaJIT on older CPUs without SSE2 support, the JIT compiler
  99. is disabled and the VM falls back to the LuaJIT interpreter. This is faster
  100. than the Lua interpreter, but not nearly as fast as the JIT compiler of course.
  101. Run the command line executable without arguments to show the current status
  102. (<tt>JIT: ON</tt> or <tt>JIT: OFF</tt>).
  103. </li>
  104. <li>
  105. The VM is complete in the sense that it <b>should</b> run all Lua code
  106. just fine. It's considered a serious bug if the VM crashes or produces
  107. unexpected results &mdash; please report this. There are only very few
  108. known incompatibilities with standard Lua:
  109. <ul>
  110. <li>
  111. The Lua <b>debug API</b> is missing a couple of features (return
  112. hooks for non-Lua functions) and shows slightly different behavior
  113. (no per-coroutine hooks, no tail call counting).
  114. </li>
  115. <li>
  116. <b>Bytecode</b> currently cannot be loaded or dumped. Note that
  117. the bytecode format differs from Lua&nbsp;5.1 &mdash; loading foreign
  118. bytecode is not supported at all.
  119. </li>
  120. <li>
  121. Some of the <b>configuration options</b> of Lua&nbsp;5.1 are not supported:
  122. <ul>
  123. <li>The <b>number type</b> cannot be changed (it's always a <tt>double</tt>).</li>
  124. <li>The stand-alone executable cannot be linked with <b>readline</b>
  125. to enable line editing. It's planned to add support for loading it
  126. on-demand.</li>
  127. </ul>
  128. </li>
  129. <li>
  130. Most other issues you're likely to find (e.g. with the existing test
  131. suites) are differences in the <b>implementation-defined</b> behavior.
  132. These either have a good reason (like early tail call resolving which
  133. may cause differences in error reporting), are arbitrary design choices
  134. or are due to quirks in the VM. The latter cases may get fixed if a
  135. demonstrable need is shown.
  136. </li>
  137. </ul>
  138. </li>
  139. <li>
  140. The <b>JIT compiler</b> falls back to the
  141. interpreter in some cases. All of this works transparently, so unless
  142. you use <tt>-jv</tt>, you'll probably never notice (the interpreter is
  143. <a href="http://luajit.org/performance.html"><span class="ext">&raquo;</span>&nbsp;quite fast</a>, too). Here are the known issues:
  144. <ul>
  145. <li>
  146. Most known issues cause a <b>NYI</b> (not yet implemented) trace abort
  147. message. E.g. for calls to some internal library
  148. functions. Reporting these is only mildly useful, except if you have good
  149. example code that shows the problem. Obviously, reports accompanied with
  150. a patch to fix the issue are more than welcome. But please check back
  151. with me, before writing major improvements, to avoid duplication of
  152. effort.
  153. </li>
  154. <li>
  155. The trace compiler currently doesn't back off specialization for
  156. function call dispatch. It should really fall back to specializing on
  157. the prototype, not the closure identity. This can lead to the so-called
  158. "trace explosion" problem with <b>closure-heavy programming</b>. The
  159. trace linking heuristics prevent this, but in the worst case this
  160. means the code always falls back to the interpreter.
  161. </li>
  162. <li>
  163. <b>Trace management</b> needs more tuning: less drastic countermeasures
  164. against trace explosion and better heuristics in general.
  165. </li>
  166. <li>
  167. Some checks are missing in the JIT-compiled code for obscure situations
  168. with <b>open upvalues aliasing</b> one of the SSA slots later on (or
  169. vice versa). Bonus points, if you can find a real world test case for
  170. this.
  171. </li>
  172. <li>
  173. Currently some <b>out-of-memory</b> errors from <b>on-trace code</b> are not
  174. handled correctly. The error may fall through an on-trace
  175. <tt>pcall</tt> (x86) or it may be passed on to the function set with
  176. <tt>lua_atpanic</tt> (x64).
  177. </li>
  178. </ul>
  179. </li>
  180. </ul>
  181. <h2>Roadmap</h2>
  182. <p>
  183. Please refer to the
  184. <a href="http://lua-users.org/lists/lua-l/2011-01/msg01238.html"><span class="ext">&raquo;</span>&nbsp;LuaJIT
  185. Roadmap 2011</a> for the latest release plan. Here's the general
  186. project plan for LuaJIT 2.0:
  187. </p>
  188. <ul>
  189. <li>
  190. The main goal right now is to stabilize LuaJIT 2.0 and get it out of
  191. beta test. <b>Correctness</b> has priority over completeness. This
  192. implies the first stable release will certainly NOT compile every
  193. library function call and will fall back to the interpreter from time
  194. to time. This is perfectly ok, since it still executes all Lua code,
  195. just not at the highest possible speed.
  196. </li>
  197. <li>
  198. The next step is to get it to compile more library functions and handle
  199. more cases where the compiler currently bails out. This doesn't mean it
  200. will compile every corner case. It's much more important that it
  201. performs well in a majority of use cases. Every compiler has to make
  202. these trade-offs &mdash; <b>completeness</b> just cannot be the
  203. overriding goal for a low-footprint, low-overhead JIT compiler.
  204. </li>
  205. <li>
  206. More <b>optimizations</b> will be added in parallel to the last step on
  207. an as-needed basis. Sinking of stores
  208. to aggregates and sinking of allocations are high on the list.
  209. More complex optimizations with less pay-off, such as value-range-propagation
  210. (VRP) will have to wait.
  211. </li>
  212. <li>
  213. LuaJIT 2.0 has been designed with <b>portability</b> in mind.
  214. Nonetheless, it compiles to native code and needs to be adapted to each
  215. architecture. The two major work items are porting the the fast interpreter,
  216. which is written in assembler, and porting the compiler backend.
  217. Most other portability issues like endianess or 32 vs. 64&nbsp;bit CPUs
  218. have already been taken care of.<br>
  219. Several ports are already available, thanks to the
  220. <a href="http://luajit.org/sponsors.html"><span class="ext">&raquo;</span>&nbsp;LuaJIT sponsorship program</a>.
  221. More ports will follow in the future &mdash; companies which are
  222. interested in sponsoring a port to a particular architecture, please
  223. use the given contact address.
  224. </li>
  225. <li>
  226. <b>Documentation</b> about the <b>internals</b> of LuaJIT is still sorely
  227. missing. Although the source code is included and is IMHO well
  228. commented, many basic design decisions are in need of an explanation.
  229. The rather un-traditional compiler architecture and the many highly
  230. optimized data structures are a barrier for outside participation in
  231. the development. Alas, as I've repeatedly stated, I'm better at
  232. writing code than papers and I'm not in need of any academic merits.
  233. Someday I will find the time for it. :-)
  234. </li>
  235. <li>
  236. Producing good code for unbiased branches is a key problem for trace
  237. compilers. This is the main cause for "trace explosion".
  238. <b>Hyperblock scheduling</b> promises to solve this nicely at the
  239. price of a major redesign of the compiler. This would also pave the
  240. way for emitting predicated instructions, which is a prerequisite
  241. for efficient <b>vectorization</b>.
  242. </li>
  243. </ul>
  244. <br class="flush">
  245. </div>
  246. <div id="foot">
  247. <hr class="hide">
  248. Copyright &copy; 2005-2011 Mike Pall
  249. <span class="noprint">
  250. &middot;
  251. <a href="contact.html">Contact</a>
  252. </span>
  253. </div>
  254. </body>
  255. </html>