README.1st 2.1 KB

1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950515253545556575859606162636465666768
  1. This is SIM, Software and text similarity tester, most recent revision
  2. (2.19, 20050220)
  3. by Dick Grune, Vrije Universiteit, Amsterdam, the Netherlands ([email protected]).
  4. SIM tests lexical similarity in texts in C, Java, Pascal, Modula-2, Lisp,
  5. Miranda and natural language. It can be used
  6. - to detect potentially duplicated code fragments in large software projects,
  7. - to detect plagiarism in software and text-based projects, educational and
  8. otherwise.
  9. The program is fast:
  10. the UNIX version on a Sun ULTRA does about 50000 tokens/sec,
  11. the DOS version on a Pentium 166 does about 25000 tokens/sec.
  12. SIM is available for UNIX (in source code) and MSDOS (32-bit executables).
  13. UNIX:
  14. To obtain the files, do:
  15. sh sim_2_21.shar
  16. This unpacks the sources, the Makefile, sim.1 and READ_ME.
  17. For installation notes and other info then see READ_ME.
  18. MSDOS:
  19. To obtain the files, do:
  20. [pk]unzip SIM_2_21.zip
  21. This unpacks the executables, SIM.DOC and READ.ME.
  22. For other info then see READ.ME.
  23. Changes from Release 2.19:
  24. Various changes necessitated by Linux flex being different
  25. Changes from Release 2.16:
  26. Various updates and adjustments in the code and the installation
  27. procedure.
  28. Changes from Release 2.13:
  29. Percentage reporting feature added.
  30. Changes from Release 2.12:
  31. Miranda checker added.
  32. Changes from Release 2.9:
  33. Java checker added.
  34. The C checker 'sim' was renamed to 'sim_c', for uniformity.
  35. Converted the sources to ANSI C.
  36. All versions now report non_ASCI characters in the input.
  37. Changes from Release 2.8:
  38. DOS versions can now compare very large files (>400000 tokens)
  39. Changes from Release 1.21, as posted in comp.sources.unix (1987):
  40. Ported to MSDOS
  41. Significant speed improvements
  42. New options: -e, -S and / , to compare files group-wise
  43. New option: -F , to require function names to match exactly
  44. Lisp version added
  45. Miscellaneous improvements
  46. Dick Grune
  47. Vrije Universiteit
  48. de Boelelaan 1081
  49. 1081 HV Amsterdam
  50. the Netherlands
  51. email: [email protected]
  52. ftp://ftp.cs.vu.nl/pub/dick
  53. http://www.cs.vu.nl/~dick