ext_ffi_tutorial.html 22 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516517518519520521522523524525526527528529530531532533534535536537538539540541542543544545546547548549550551552553554555556557558559560561562563564565566567568569570571572573574575576577578579580581582583584585586587588589590591592593594595596597598599600601602603604605606607
  1. <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
  2. <html>
  3. <head>
  4. <title>FFI Tutorial</title>
  5. <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
  6. <meta name="Author" content="Mike Pall">
  7. <meta name="Copyright" content="Copyright (C) 2005-2012, Mike Pall">
  8. <meta name="Language" content="en">
  9. <link rel="stylesheet" type="text/css" href="bluequad.css" media="screen">
  10. <link rel="stylesheet" type="text/css" href="bluequad-print.css" media="print">
  11. <style type="text/css">
  12. span.codemark { position:absolute; left: 16em; color: #4040c0; }
  13. span.mark { color: #4040c0; font-family: Courier New, Courier, monospace;
  14. line-height: 1.1; }
  15. pre.mark { padding-left: 2em; }
  16. table.idiomtable { line-height: 1.2; }
  17. table.idiomtable tt { font-size: 100%; }
  18. table.idiomtable td { vertical-align: top; }
  19. tr.idiomhead td { font-weight: bold; }
  20. td.idiomc { width: 12em; }
  21. td.idiomlua { width: 14em; }
  22. td.idiomlua b { font-weight: normal; color: #2142bf; }
  23. </style>
  24. </head>
  25. <body>
  26. <div id="site">
  27. <a href="http://luajit.org"><span>Lua<span id="logo">JIT</span></span></a>
  28. </div>
  29. <div id="head">
  30. <h1>FFI Tutorial</h1>
  31. </div>
  32. <div id="nav">
  33. <ul><li>
  34. <a href="luajit.html">LuaJIT</a>
  35. <ul><li>
  36. <a href="install.html">Installation</a>
  37. </li><li>
  38. <a href="running.html">Running</a>
  39. </li></ul>
  40. </li><li>
  41. <a href="extensions.html">Extensions</a>
  42. <ul><li>
  43. <a href="ext_ffi.html">FFI Library</a>
  44. <ul><li>
  45. <a class="current" href="ext_ffi_tutorial.html">FFI Tutorial</a>
  46. </li><li>
  47. <a href="ext_ffi_api.html">ffi.* API</a>
  48. </li><li>
  49. <a href="ext_ffi_semantics.html">FFI Semantics</a>
  50. </li></ul>
  51. </li><li>
  52. <a href="ext_jit.html">jit.* Library</a>
  53. </li><li>
  54. <a href="ext_c_api.html">Lua/C API</a>
  55. </li></ul>
  56. </li><li>
  57. <a href="status.html">Status</a>
  58. <ul><li>
  59. <a href="changes.html">Changes</a>
  60. </li></ul>
  61. </li><li>
  62. <a href="faq.html">FAQ</a>
  63. </li><li>
  64. <a href="http://luajit.org/performance.html">Performance <span class="ext">&raquo;</span></a>
  65. </li><li>
  66. <a href="http://luajit.org/download.html">Download <span class="ext">&raquo;</span></a>
  67. </li><li>
  68. <a href="http://wiki.luajit.org/">Wiki <span class="ext">&raquo;</span></a>
  69. </li><li>
  70. <a href="http://luajit.org/list.html">Mailing List <span class="ext">&raquo;</span></a>
  71. </li></ul>
  72. </div>
  73. <div id="main">
  74. <p>
  75. This page is intended to give you an overview of the features of the FFI
  76. library by presenting a few use cases and guidelines.
  77. </p>
  78. <p>
  79. This page makes no attempt to explain all of the FFI library, though.
  80. You'll want to have a look at the <a href="ext_ffi_api.html">ffi.* API
  81. function reference</a> and the <a href="ext_ffi_semantics.html">FFI
  82. semantics</a> to learn more.
  83. </p>
  84. <h2 id="load">Loading the FFI Library</h2>
  85. <p>
  86. The FFI library is built into LuaJIT by default, but it's not loaded
  87. and initialized by default. The suggested way to use the FFI library
  88. is to add the following to the start of every Lua file that needs one
  89. of its functions:
  90. </p>
  91. <pre class="code">
  92. local ffi = require("ffi")
  93. </pre>
  94. <p>
  95. Please note this doesn't define an <tt>ffi</tt> variable in the table
  96. of globals &mdash; you really need to use the local variable. The
  97. <tt>require</tt> function ensures the library is only loaded once.
  98. </p>
  99. <p style="font-size: 8pt;">
  100. Note: If you want to experiment with the FFI from the interactive prompt
  101. of the command line executable, omit the <tt>local</tt>, as it doesn't
  102. preserve local variables across lines.
  103. </p>
  104. <h2 id="sleep">Accessing Standard System Functions</h2>
  105. <p>
  106. The following code explains how to access standard system functions.
  107. We slowly print two lines of dots by sleeping for 10&nbsp;milliseconds
  108. after each dot:
  109. </p>
  110. <pre class="code mark">
  111. <span class="codemark">&nbsp;
  112. &#9312;
  113. &#9313;
  114. &#9314;
  115. &#9315;
  116. &#9316;
  117. &#9317;</span>local ffi = require("ffi")
  118. ffi.cdef[[
  119. <span style="color:#00a000;">void Sleep(int ms);
  120. int poll(struct pollfd *fds, unsigned long nfds, int timeout);</span>
  121. ]]
  122. local sleep
  123. if ffi.os == "Windows" then
  124. function sleep(s)
  125. ffi.C.Sleep(s*1000)
  126. end
  127. else
  128. function sleep(s)
  129. ffi.C.poll(nil, 0, s*1000)
  130. end
  131. end
  132. for i=1,160 do
  133. io.write("."); io.flush()
  134. sleep(0.01)
  135. end
  136. io.write("\n")
  137. </pre>
  138. <p>
  139. Here's the step-by-step explanation:
  140. </p>
  141. <p>
  142. <span class="mark">&#9312;</span> This defines the
  143. C&nbsp;library functions we're going to use. The part inside the
  144. double-brackets (in green) is just standard C&nbsp;syntax. You can
  145. usually get this info from the C&nbsp;header files or the
  146. documentation provided by each C&nbsp;library or C&nbsp;compiler.
  147. </p>
  148. <p>
  149. <span class="mark">&#9313;</span> The difficulty we're
  150. facing here, is that there are different standards to choose from.
  151. Windows has a simple <tt>Sleep()</tt> function. On other systems there
  152. are a variety of functions available to achieve sub-second sleeps, but
  153. with no clear consensus. Thankfully <tt>poll()</tt> can be used for
  154. this task, too, and it's present on most non-Windows systems. The
  155. check for <tt>ffi.os</tt> makes sure we use the Windows-specific
  156. function only on Windows systems.
  157. </p>
  158. <p>
  159. <span class="mark">&#9314;</span> Here we're wrapping the
  160. call to the C&nbsp;function in a Lua function. This isn't strictly
  161. necessary, but it's helpful to deal with system-specific issues only
  162. in one part of the code. The way we're wrapping it ensures the check
  163. for the OS is only done during initialization and not for every call.
  164. </p>
  165. <p>
  166. <span class="mark">&#9315;</span> A more subtle point is
  167. that we defined our <tt>sleep()</tt> function (for the sake of this
  168. example) as taking the number of seconds, but accepting fractional
  169. seconds. Multiplying this by 1000 gets us milliseconds, but that still
  170. leaves it a Lua number, which is a floating-point value. Alas, the
  171. <tt>Sleep()</tt> function only accepts an integer value. Luckily for
  172. us, the FFI library automatically performs the conversion when calling
  173. the function (truncating the FP value towards zero, like in C).
  174. </p>
  175. <p style="font-size: 8pt;">
  176. Some readers will notice that <tt>Sleep()</tt> is part of
  177. <tt>KERNEL32.DLL</tt> and is also a <tt>stdcall</tt> function. So how
  178. can this possibly work? The FFI library provides the <tt>ffi.C</tt>
  179. default C&nbsp;library namespace, which allows calling functions from
  180. the default set of libraries, like a C&nbsp;compiler would. Also, the
  181. FFI library automatically detects <tt>stdcall</tt> functions, so you
  182. don't need to declare them as such.
  183. </p>
  184. <p>
  185. <span class="mark">&#9316;</span> The <tt>poll()</tt>
  186. function takes a couple more arguments we're not going to use. You can
  187. simply use <tt>nil</tt> to pass a <tt>NULL</tt> pointer and <tt>0</tt>
  188. for the <tt>nfds</tt> parameter. Please note that the
  189. number&nbsp;<tt>0</tt> <em>does not convert to a pointer value</em>,
  190. unlike in C++. You really have to pass pointers to pointer arguments
  191. and numbers to number arguments.
  192. </p>
  193. <p style="font-size: 8pt;">
  194. The page on <a href="ext_ffi_semantics.html">FFI semantics</a> has all
  195. of the gory details about
  196. <a href="ext_ffi_semantics.html#convert">conversions between Lua
  197. objects and C&nbsp;types</a>. For the most part you don't have to deal
  198. with this, as it's performed automatically and it's carefully designed
  199. to bridge the semantic differences between Lua and C.
  200. </p>
  201. <p>
  202. <span class="mark">&#9317;</span> Now that we have defined
  203. our own <tt>sleep()</tt> function, we can just call it from plain Lua
  204. code. That wasn't so bad, huh? Turning these boring animated dots into
  205. a fascinating best-selling game is left as an exercise for the reader.
  206. :-)
  207. </p>
  208. <h2 id="zlib">Accessing the zlib Compression Library</h2>
  209. <p>
  210. The following code shows how to access the <a
  211. href="http://zlib.net/">zlib</a> compression library from Lua code.
  212. We'll define two convenience wrapper functions that take a string and
  213. compress or uncompress it to another string:
  214. </p>
  215. <pre class="code mark">
  216. <span class="codemark">&nbsp;
  217. &#9312;
  218. &#9313;
  219. &#9314;
  220. &#9315;
  221. &#9316;
  222. &#9317;
  223. &#9318;</span>local ffi = require("ffi")
  224. ffi.cdef[[
  225. <span style="color:#00a000;">unsigned long compressBound(unsigned long sourceLen);
  226. int compress2(uint8_t *dest, unsigned long *destLen,
  227. const uint8_t *source, unsigned long sourceLen, int level);
  228. int uncompress(uint8_t *dest, unsigned long *destLen,
  229. const uint8_t *source, unsigned long sourceLen);</span>
  230. ]]
  231. local zlib = ffi.load(ffi.os == "Windows" and "zlib1" or "z")
  232. local function compress(txt)
  233. local n = zlib.compressBound(#txt)
  234. local buf = ffi.new("uint8_t[?]", n)
  235. local buflen = ffi.new("unsigned long[1]", n)
  236. local res = zlib.compress2(buf, buflen, txt, #txt, 9)
  237. assert(res == 0)
  238. return ffi.string(buf, buflen[0])
  239. end
  240. local function uncompress(comp, n)
  241. local buf = ffi.new("uint8_t[?]", n)
  242. local buflen = ffi.new("unsigned long[1]", n)
  243. local res = zlib.uncompress(buf, buflen, comp, #comp)
  244. assert(res == 0)
  245. return ffi.string(buf, buflen[0])
  246. end
  247. -- Simple test code.
  248. local txt = string.rep("abcd", 1000)
  249. print("Uncompressed size: ", #txt)
  250. local c = compress(txt)
  251. print("Compressed size: ", #c)
  252. local txt2 = uncompress(c, #txt)
  253. assert(txt2 == txt)
  254. </pre>
  255. <p>
  256. Here's the step-by-step explanation:
  257. </p>
  258. <p>
  259. <span class="mark">&#9312;</span> This defines some of the
  260. C&nbsp;functions provided by zlib. For the sake of this example, some
  261. type indirections have been reduced and it uses the pre-defined
  262. fixed-size integer types, while still adhering to the zlib API/ABI.
  263. </p>
  264. <p>
  265. <span class="mark">&#9313;</span> This loads the zlib shared
  266. library. On POSIX systems it's named <tt>libz.so</tt> and usually
  267. comes pre-installed. Since <tt>ffi.load()</tt> automatically adds any
  268. missing standard prefixes/suffixes, we can simply load the
  269. <tt>"z"</tt> library. On Windows it's named <tt>zlib1.dll</tt> and
  270. you'll have to download it first from the
  271. <a href="http://zlib.net/"><span class="ext">&raquo;</span>&nbsp;zlib site</a>. The check for
  272. <tt>ffi.os</tt> makes sure we pass the right name to
  273. <tt>ffi.load()</tt>.
  274. </p>
  275. <p>
  276. <span class="mark">&#9314;</span> First, the maximum size of
  277. the compression buffer is obtained by calling the
  278. <tt>zlib.compressBound</tt> function with the length of the
  279. uncompressed string. The next line allocates a byte buffer of this
  280. size. The <tt>[?]</tt> in the type specification indicates a
  281. variable-length array (VLA). The actual number of elements of this
  282. array is given as the 2nd argument to <tt>ffi.new()</tt>.
  283. </p>
  284. <p>
  285. <span class="mark">&#9315;</span> This may look strange at
  286. first, but have a look at the declaration of the <tt>compress2</tt>
  287. function from zlib: the destination length is defined as a pointer!
  288. This is because you pass in the maximum buffer size and get back the
  289. actual length that was used.
  290. </p>
  291. <p>
  292. In C you'd pass in the address of a local variable
  293. (<tt>&amp;buflen</tt>). But since there's no address-of operator in
  294. Lua, we'll just pass in a one-element array. Conveniently it can be
  295. initialized with the maximum buffer size in one step. Calling the
  296. actual <tt>zlib.compress2</tt> function is then straightforward.
  297. </p>
  298. <p>
  299. <span class="mark">&#9316;</span> We want to return the
  300. compressed data as a Lua string, so we'll use <tt>ffi.string()</tt>.
  301. It needs a pointer to the start of the data and the actual length. The
  302. length has been returned in the <tt>buflen</tt> array, so we'll just
  303. get it from there.
  304. </p>
  305. <p style="font-size: 8pt;">
  306. Note that since the function returns now, the <tt>buf</tt> and
  307. <tt>buflen</tt> variables will eventually be garbage collected. This
  308. is fine, because <tt>ffi.string()</tt> has copied the contents to a
  309. newly created (interned) Lua string. If you plan to call this function
  310. lots of times, consider reusing the buffers and/or handing back the
  311. results in buffers instead of strings. This will reduce the overhead
  312. for garbage collection and string interning.
  313. </p>
  314. <p>
  315. <span class="mark">&#9317;</span> The <tt>uncompress</tt>
  316. functions does the exact opposite of the <tt>compress</tt> function.
  317. The compressed data doesn't include the size of the original string,
  318. so this needs to be passed in. Otherwise no surprises here.
  319. </p>
  320. <p>
  321. <span class="mark">&#9318;</span> The code, that makes use
  322. of the functions we just defined, is just plain Lua code. It doesn't
  323. need to know anything about the LuaJIT FFI &mdash; the convenience
  324. wrapper functions completely hide it.
  325. </p>
  326. <p>
  327. One major advantage of the LuaJIT FFI is that you are now able to
  328. write those wrappers <em>in Lua</em>. And at a fraction of the time it
  329. would cost you to create an extra C&nbsp;module using the Lua/C API.
  330. Many of the simpler C&nbsp;functions can probably be used directly
  331. from your Lua code, without any wrappers.
  332. </p>
  333. <p style="font-size: 8pt;">
  334. Side note: the zlib API uses the <tt>long</tt> type for passing
  335. lengths and sizes around. But all those zlib functions actually only
  336. deal with 32&nbsp;bit values. This is an unfortunate choice for a
  337. public API, but may be explained by zlib's history &mdash; we'll just
  338. have to deal with it.
  339. </p>
  340. <p style="font-size: 8pt;">
  341. First, you should know that a <tt>long</tt> is a 64&nbsp;bit type e.g.
  342. on POSIX/x64 systems, but a 32&nbsp;bit type on Windows/x64 and on
  343. 32&nbsp;bit systems. Thus a <tt>long</tt> result can be either a plain
  344. Lua number or a boxed 64&nbsp;bit integer cdata object, depending on
  345. the target system.
  346. </p>
  347. <p style="font-size: 8pt;">
  348. Ok, so the <tt>ffi.*</tt> functions generally accept cdata objects
  349. wherever you'd want to use a number. That's why we get a away with
  350. passing <tt>n</tt> to <tt>ffi.string()</tt> above. But other Lua
  351. library functions or modules don't know how to deal with this. So for
  352. maximum portability one needs to use <tt>tonumber()</tt> on returned
  353. <tt>long</tt> results before passing them on. Otherwise the
  354. application might work on some systems, but would fail in a POSIX/x64
  355. environment.
  356. </p>
  357. <h2 id="metatype">Defining Metamethods for a C&nbsp;Type</h2>
  358. <p>
  359. The following code explains how to define metamethods for a C type.
  360. We define a simple point type and add some operations to it:
  361. </p>
  362. <pre class="code mark">
  363. <span class="codemark">&nbsp;
  364. &#9312;
  365. &#9313;
  366. &#9314;
  367. &#9315;
  368. &#9316;
  369. &#9317;</span>local ffi = require("ffi")
  370. ffi.cdef[[
  371. <span style="color:#00a000;">typedef struct { double x, y; } point_t;</span>
  372. ]]
  373. local point
  374. local mt = {
  375. __add = function(a, b) return point(a.x+b.x, a.y+b.y) end,
  376. __len = function(a) return math.sqrt(a.x*a.x + a.y*a.y) end,
  377. __index = {
  378. area = function(a) return a.x*a.x + a.y*a.y end,
  379. },
  380. }
  381. point = ffi.metatype("point_t", mt)
  382. local a = point(3, 4)
  383. print(a.x, a.y) --> 3 4
  384. print(#a) --> 5
  385. print(a:area()) --> 25
  386. local b = a + point(0.5, 8)
  387. print(#b) --> 12.5
  388. </pre>
  389. <p>
  390. Here's the step-by-step explanation:
  391. </p>
  392. <p>
  393. <span class="mark">&#9312;</span> This defines the C&nbsp;type for a
  394. two-dimensional point object.
  395. </p>
  396. <p>
  397. <span class="mark">&#9313;</span> We have to declare the variable
  398. holding the point constructor first, because it's used inside of a
  399. metamethod.
  400. </p>
  401. <p>
  402. <span class="mark">&#9314;</span> Let's define an <tt>__add</tt>
  403. metamethod which adds the coordinates of two points and creates a new
  404. point object. For simplicity, this function assumes that both arguments
  405. are points. But it could be any mix of objects, if at least one operand
  406. is of the required type (e.g. adding a point plus a number or vice
  407. versa). Our <tt>__len</tt> metamethod returns the distance of a point to
  408. the origin.
  409. </p>
  410. <p>
  411. <span class="mark">&#9315;</span> If we run out of operators, we can
  412. define named methods, too. Here the <tt>__index</tt> table defines an
  413. <tt>area</tt> function. For custom indexing needs, one might want to
  414. define <tt>__index</tt> and <tt>__newindex</tt> <em>functions</em> instead.
  415. </p>
  416. <p>
  417. <span class="mark">&#9316;</span> This associates the metamethods with
  418. our C&nbsp;type. This only needs to be done once. For convenience, a
  419. constructor is returned by
  420. <a href="ext_ffi_api.html#ffi_metatype"><tt>ffi.metatype()</tt></a>.
  421. We're not required to use it, though. The original C&nbsp;type can still
  422. be used e.g. to create an array of points. The metamethods automatically
  423. apply to any and all uses of this type.
  424. </p>
  425. <p>
  426. Please note that the association with a metatable is permanent and
  427. <b>the metatable must not be modified afterwards!</b> Ditto for the
  428. <tt>__index</tt> table.
  429. </p>
  430. <p>
  431. <span class="mark">&#9317;</span> Here are some simple usage examples
  432. for the point type and their expected results. The pre-defined
  433. operations (such as <tt>a.x</tt>) can be freely mixed with the newly
  434. defined metamethods. Note that <tt>area</tt> is a method and must be
  435. called with the Lua syntax for methods: <tt>a:area()</tt>, not
  436. <tt>a.area()</tt>.
  437. </p>
  438. <p>
  439. The C&nbsp;type metamethod mechanism is most useful when used in
  440. conjunction with C&nbsp;libraries that are written in an object-oriented
  441. style. Creators return a pointer to a new instance and methods take an
  442. instance pointer as the first argument. Sometimes you can just point
  443. <tt>__index</tt> to the library namespace and <tt>__gc</tt> to the
  444. destructor and you're done. But often enough you'll want to add
  445. convenience wrappers, e.g. to return actual Lua strings or when
  446. returning multiple values.
  447. </p>
  448. <p>
  449. Some C libraries only declare instance pointers as an opaque
  450. <tt>void&nbsp;*</tt> type. In this case you can use a fake type for all
  451. declarations, e.g. a pointer to a named (incomplete) struct will do:
  452. <tt>typedef struct foo_type *foo_handle</tt>. The C&nbsp;side doesn't
  453. know what you declare with the LuaJIT FFI, but as long as the underlying
  454. types are compatible, everything still works.
  455. </p>
  456. <h2 id="idioms">Translating C&nbsp;Idioms</h2>
  457. <p>
  458. Here's a list of common C&nbsp;idioms and their translation to the
  459. LuaJIT FFI:
  460. </p>
  461. <table class="idiomtable">
  462. <tr class="idiomhead">
  463. <td class="idiomdesc">Idiom</td>
  464. <td class="idiomc">C&nbsp;code</td>
  465. <td class="idiomlua">Lua code</td>
  466. </tr>
  467. <tr class="odd separate">
  468. <td class="idiomdesc">Pointer dereference<br><tt>int *p;</tt></td><td class="idiomc"><tt>x = *p;<br>*p = y;</tt></td><td class="idiomlua"><tt>x = <b>p[0]</b><br><b>p[0]</b> = y</tt></td></tr>
  469. <tr class="even">
  470. <td class="idiomdesc">Pointer indexing<br><tt>int i, *p;</tt></td><td class="idiomc"><tt>x = p[i];<br>p[i+1] = y;</tt></td><td class="idiomlua"><tt>x = p[i]<br>p[i+1] = y</tt></td></tr>
  471. <tr class="odd">
  472. <td class="idiomdesc">Array indexing<br><tt>int i, a[];</tt></td><td class="idiomc"><tt>x = a[i];<br>a[i+1] = y;</tt></td><td class="idiomlua"><tt>x = a[i]<br>a[i+1] = y</tt></td></tr>
  473. <tr class="even separate">
  474. <td class="idiomdesc"><tt>struct</tt>/<tt>union</tt> dereference<br><tt>struct foo s;</tt></td><td class="idiomc"><tt>x = s.field;<br>s.field = y;</tt></td><td class="idiomlua"><tt>x = s.field<br>s.field = y</tt></td></tr>
  475. <tr class="odd">
  476. <td class="idiomdesc"><tt>struct</tt>/<tt>union</tt> pointer deref.<br><tt>struct foo *sp;</tt></td><td class="idiomc"><tt>x = sp->field;<br>sp->field = y;</tt></td><td class="idiomlua"><tt>x = <b>s.field</b><br><b>s.field</b> = y</tt></td></tr>
  477. <tr class="even separate">
  478. <td class="idiomdesc">Pointer arithmetic<br><tt>int i, *p;</tt></td><td class="idiomc"><tt>x = p + i;<br>y = p - i;</tt></td><td class="idiomlua"><tt>x = p + i<br>y = p - i</tt></td></tr>
  479. <tr class="odd">
  480. <td class="idiomdesc">Pointer difference<br><tt>int *p1, *p2;</tt></td><td class="idiomc"><tt>x = p1 - p2;</tt></td><td class="idiomlua"><tt>x = p1 - p2</tt></td></tr>
  481. <tr class="even">
  482. <td class="idiomdesc">Array element pointer<br><tt>int i, a[];</tt></td><td class="idiomc"><tt>x = &amp;a[i];</tt></td><td class="idiomlua"><tt>x = <b>a+i</b></tt></td></tr>
  483. <tr class="odd">
  484. <td class="idiomdesc">Cast pointer to address<br><tt>int *p;</tt></td><td class="idiomc"><tt>x = (intptr_t)p;</tt></td><td class="idiomlua"><tt>x = <b>tonumber(<br>&nbsp;ffi.cast("intptr_t",<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;p))</b></tt></td></tr>
  485. <tr class="even separate">
  486. <td class="idiomdesc">Functions with outargs<br><tt>void foo(int *inoutlen);</tt></td><td class="idiomc"><tt>int len = x;<br>foo(&amp;len);<br>y = len;</tt></td><td class="idiomlua"><tt><b>local len =<br>&nbsp;&nbsp;ffi.new("int[1]", x)<br>foo(len)<br>y = len[0]</b></tt></td></tr>
  487. <tr class="odd">
  488. <td class="idiomdesc"><a href="ext_ffi_semantics.html#convert_vararg">Vararg conversions</a><br><tt>int printf(char *fmt, ...);</tt></td><td class="idiomc"><tt>printf("%g", 1.0);<br>printf("%d", 1);<br>&nbsp;</tt></td><td class="idiomlua"><tt>printf("%g", 1);<br>printf("%d",<br>&nbsp;&nbsp;<b>ffi.new("int", 1)</b>)</tt></td></tr>
  489. </table>
  490. <h2 id="cache">To Cache or Not to Cache</h2>
  491. <p>
  492. It's a common Lua idiom to cache library functions in local variables
  493. or upvalues, e.g.:
  494. </p>
  495. <pre class="code">
  496. local byte, char = string.byte, string.char
  497. local function foo(x)
  498. return char(byte(x)+1)
  499. end
  500. </pre>
  501. <p>
  502. This replaces several hash-table lookups with a (faster) direct use of
  503. a local or an upvalue. This is less important with LuaJIT, since the
  504. JIT compiler optimizes hash-table lookups a lot and is even able to
  505. hoist most of them out of the inner loops. It can't eliminate
  506. <em>all</em> of them, though, and it saves some typing for often-used
  507. functions. So there's still a place for this, even with LuaJIT.
  508. </p>
  509. <p>
  510. The situation is a bit different with C&nbsp;function calls via the
  511. FFI library. The JIT compiler has special logic to eliminate <em>all
  512. of the lookup overhead</em> for functions resolved from a
  513. <a href="ext_ffi_semantics.html#clib">C&nbsp;library namespace</a>!
  514. Thus it's not helpful and actually counter-productive to cache
  515. individual C&nbsp;functions like this:
  516. </p>
  517. <pre class="code">
  518. local <b>funca</b>, <b>funcb</b> = ffi.C.funcb, ffi.C.funcb -- <span style="color:#c00000;">Not helpful!</span>
  519. local function foo(x, n)
  520. for i=1,n do <b>funcb</b>(<b>funca</b>(x, i), 1) end
  521. end
  522. </pre>
  523. <p>
  524. This turns them into indirect calls and generates bigger and slower
  525. machine code. Instead you'll want to cache the namespace itself and
  526. rely on the JIT compiler to eliminate the lookups:
  527. </p>
  528. <pre class="code">
  529. local <b>C</b> = ffi.C -- <span style="color:#00a000;">Instead use this!</span>
  530. local function foo(x, n)
  531. for i=1,n do <b>C.funcb</b>(<b>C.funca</b>(x, i), 1) end
  532. end
  533. </pre>
  534. <p>
  535. This generates both shorter and faster code. So <b>don't cache
  536. C&nbsp;functions</b>, but <b>do</b> cache namespaces! Most often the
  537. namespace is already in a local variable at an outer scope, e.g. from
  538. <tt>local&nbsp;lib&nbsp;=&nbsp;ffi.load(...)</tt>. Note that copying
  539. it to a local variable in the function scope is unnecessary.
  540. </p>
  541. <br class="flush">
  542. </div>
  543. <div id="foot">
  544. <hr class="hide">
  545. Copyright &copy; 2005-2012 Mike Pall
  546. <span class="noprint">
  547. &middot;
  548. <a href="contact.html">Contact</a>
  549. </span>
  550. </div>
  551. </body>
  552. </html>