c-sharp 8.2 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207
  1. * MCS: The Ximian C# compiler
  2. MCS is currently able to compile itself and many more C#
  3. programs (there is a test suite included that you can use).
  4. It is routinely used to compile Mono, roughly half a million
  5. lines of C# code.
  6. We are in feature completion mode right now. There are still
  7. a couple of areas that are not covered by the Mono compiler, but
  8. they are very very few at this point (security attributes),
  9. you can also browse the MCS <a href="http://bugzilla.ximian.com/buglist.cgi?product=Mono%2FMCS&bug_status=NEW&bug_status=ASSIGNED&bug_status=REOPENED&email1=&emailtype1=substring&emailassigned_to1=1&email2=&emailtype2=substring&emailreporter2=1&changedin=&chfieldfrom=&chfieldto=Now&chfieldvalue=&short_desc=&short_desc_type=substring&long_desc=&long_desc_type=substring&bug_file_loc=&bug_file_loc_type=substring&keywords=&keywords_type=anywords&op_sys_details=&op_sys_details_type=substring&version_details=&version_details_type=substring&cmdtype=doit&newqueryname=&order=Reuse+same+sort+as+last+time&form_name=query">bugs</a> from Bugzilla.
  10. A test suite is maintained to track the progress of
  11. the compiler and various programs are routinely compiled and
  12. ran.
  13. ** Slides
  14. Slides for the Mono C# Compiler presentation at .NET ONE are
  15. available <a
  16. href="http://primates.ximian.com/~miguel/slides-europe-nov-2002/Mono_C_Sharp_Overview_1007.sxi">here</a>
  17. in StarOffice format.
  18. ** Obtaining MCS
  19. The Mono C# compiler is part of the `mcs' module in the Mono CVS
  20. you can get it from our <a href="anoncvs.html">Anonymous CVS</a> server,
  21. or you can get nightly <a href="download.html">download page</a>.
  22. ** Running MCS
  23. MCS is written in C# and uses heavily the .NET APIs. MCS runs
  24. on Linux with the Mono runtime and on Windows with both the
  25. .NET runtime and the Mono runtime.
  26. ** Reporting Bugs in MCS
  27. When you report a bug, try to provide a small test case that would
  28. show the error so we can include this as part of the Mono C# regression
  29. test suite.
  30. If the bug is an error or a warning that we do not flag, write
  31. a sample program called `csXXXX.cs' where XXXX is the code number
  32. that is used by the Microsoft C# compiler that illustrates the
  33. problem. That way we can also do regression tests on the invalid
  34. input.
  35. ** Phases of the compiler
  36. The compiler has a number of phases:
  37. <ul>
  38. * Lexical analyzer: hand-coded lexical analyzer that
  39. provides tokens to the parser.
  40. * The Parser: the parser is implemented using Jay (A
  41. Berkeley Yacc port to Java, that I ported to C#).
  42. The parser does minimal work and syntax checking,
  43. and only constructs a parsed tree.
  44. Each language element gets its own class. The code
  45. convention is to use an uppercase name for the
  46. language element. So a C# class and its associated
  47. information is kept in a "Class" class, a "struct"
  48. in a "Struct" class and so on. Statements derive
  49. from the "Statement" class, and Expressions from the
  50. Expr class.
  51. * Parent class resolution: before the actual code
  52. generation, we need to resolve the parents and
  53. interfaces for interface, classe and struct
  54. definitions.
  55. * Semantic analysis: since C# can not resolve in a
  56. top-down pass what identifiers actually mean, we
  57. have to postpone this decision until the above steps
  58. are finished.
  59. * Code generation: The code generation is done through
  60. the System.Reflection.Emit API.
  61. </ul>
  62. ** CIL Optimizations.
  63. The compiler performs a number of simple optimizations on its input:
  64. constant folding (this is required by the C# language spec) and
  65. can perform dead code elimination.
  66. Other more interesting optimizations like hoisting are not possible
  67. at this point since the compiler output at this point does not
  68. generate an intermediate representation that is suitable to
  69. perform basic block computation.
  70. Adding an intermediate layer to enable the basic block
  71. computation to the compiler should be a simple task, but we
  72. are considering having a generic CIL optimizer. Since all the
  73. information that is required to perform basic block-based
  74. optimizations is available at the CIL level, we might just skip
  75. this step altogether and have just a generic IL optimizer that
  76. would perform hoisting on arbitrary CIL programs, not only
  77. those produced by MCS.
  78. If this tool is further expanded to perform constant folding
  79. (not needed for our C# compiler, as it is already in there)
  80. and dead code elimination, other compiler authors might be
  81. able to use this generic CIL optimizer in their projects
  82. reducing their time to develop a production compiler.
  83. ** History
  84. MCS was able to parse itself on April 2001, MCS compiled itself
  85. for the first time on December 28 2001. MCS became self hosting
  86. on January 3rd, 2002.
  87. The Mono Runtime and the Mono execution engine were able to make
  88. our compiler self hosting on March 12, 2002.
  89. ** Questions and Answers
  90. Q: Why not write a C# front-end for GCC?
  91. A: I wanted to learn about C#, and this was an exercise in this
  92. task. The resulting compiler is highly object-oriented, which has
  93. lead to a very nice, easy to follow and simple implementation of
  94. the compiler.
  95. I found that the design of this compiler is very similar to
  96. Guavac's implementation.
  97. Targeting the CIL/MSIL byte codes would require to re-architecting
  98. GCC, as GCC is mostly designed to be used for register machines.
  99. The GCC Java engine that generates Java byte codes cheats: it does
  100. not use the GCC backend; it has a special backend just for Java, so
  101. you can not really generate Java bytecodes from the other languages
  102. supported by GCC.
  103. Q: If your C# compiler is written in C#, how do you plan on getting
  104. this working on a non-Microsoft environment.
  105. We will do this through an implementation of the CLI Virtual
  106. Execution System for Unix (our JIT engine).
  107. Our JIT engine is working for the purposes of using the compiler.
  108. The supporting class libraries are being worked on to fully support
  109. the compiler.
  110. Q: Do you use Bison?
  111. A: No, currently I am using Jay which is a port of Berkeley Yacc to
  112. Java that I later ported to C#. This means that error recovery is
  113. not as nice as I would like to, and for some reason error
  114. productions are not being caught.
  115. In the future I want to port one of the Bison/Java ports to C# for
  116. the parser.
  117. Q: Should someone work on a GCC front-end to C#?
  118. A: I would love if someone does, and we would love to help anyone that
  119. takes on that task, but we do not have the time or expertise to
  120. build a C# compiler with the GCC engine. I find it a lot more fun
  121. personally to work on C# on a C# compiler, which has an intrinsic
  122. beauty.
  123. We can provide help and assistance to anyone who would like to work
  124. on this task.
  125. Q: Should someone make a GCC backend that will generate CIL images?
  126. A: I would love to see a backend to GCC that generates CIL images. It
  127. would provide a ton of free compilers that would generate CIL
  128. code. This is something that people would want to look into
  129. anyways for Windows interoperation in the future.
  130. Again, we would love to provide help and assistance to anyone
  131. interested in working in such a project.
  132. Q: What about making a front-end to GCC that takes CIL images and
  133. generates native code?
  134. A: I would love to see this, specially since GCC supports this same
  135. feature for Java Byte Codes. You could use the metadata library
  136. from Mono to read the byte codes (ie, this would be your
  137. "front-end") and generate the trees that get passed to the
  138. optimizer.
  139. Ideally our implementation of the CLI will be available as a shared
  140. library that could be linked with your application as its runtime
  141. support.
  142. Again, we would love to provide help and assistance to anyone
  143. interested in working in such a project.
  144. Q: But would this work around the GPL in the GCC compiler and allow
  145. people to work on non-free front-ends?
  146. A: People can already do this by targeting the JVM byte codes (there
  147. are about 130 compilers for various languages that target the JVM).
  148. Q: Why are you writing a JIT engine instead of a front-end to GCC?
  149. A: The JIT engine and runtime engine will be able to execute CIL
  150. executables generated on Windows.
  151. You might also want to look at the <a href="faq.html#gcc">GCC</a>
  152. section on the main FAQ