Jonas Maebe 4b8643f73b * removed some clashes between ifdefs and regular tokens 17 年 前
..
Answers 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
ChangeLog 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
HOWTO-FPC.txt de08321368 + short instructions on how to use it 17 年 前
LICENSE.txt 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
Makefile 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
READ.ME 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
README.1st 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
READ_ME 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
TechnReport 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
add_run.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
add_run.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
aiso.bdy 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
aiso.spc 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
algollike.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
algollike.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
clang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
compare.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
compare.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
debug.par 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
error.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
error.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
hash.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
hash.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
idf.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
idf.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
javalang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
lang.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
language.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
lex.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
lex.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
lisplang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
m2lang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
miralang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
options.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
options.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
pascallang.l 4b8643f73b * removed some clashes between ifdefs and regular tokens 17 年 前
pass1.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
pass1.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
pass2.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
pass2.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
pass3.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
pass3.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
percentages.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
percentages.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
runs.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
runs.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
settings.par 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
sim.1 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
sim.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
sim.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
sim.html 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
sim.txt 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
sortlist.bdy 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
sortlist.spc 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
stream.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
stream.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
sysidf.mk 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
sysidf.msdos 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
sysidf.unix 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
system.par 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
text.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
text.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
textlang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
token.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
token.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
tokenarray.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前
tokenarray.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 年 前

README.1st

This is SIM, Software and text similarity tester, most recent revision
(2.19, 20050220)
by Dick Grune, Vrije Universiteit, Amsterdam, the Netherlands ([email protected]).

SIM tests lexical similarity in texts in C, Java, Pascal, Modula-2, Lisp,
Miranda and natural language. It can be used

- to detect potentially duplicated code fragments in large software projects,
- to detect plagiarism in software and text-based projects, educational and
otherwise.

The program is fast:
the UNIX version on a Sun ULTRA does about 50000 tokens/sec,
the DOS version on a Pentium 166 does about 25000 tokens/sec.

SIM is available for UNIX (in source code) and MSDOS (32-bit executables).

UNIX:
To obtain the files, do:
sh sim_2_21.shar
This unpacks the sources, the Makefile, sim.1 and READ_ME.
For installation notes and other info then see READ_ME.

MSDOS:
To obtain the files, do:
[pk]unzip SIM_2_21.zip
This unpacks the executables, SIM.DOC and READ.ME.
For other info then see READ.ME.

Changes from Release 2.19:
Various changes necessitated by Linux flex being different

Changes from Release 2.16:
Various updates and adjustments in the code and the installation
procedure.

Changes from Release 2.13:
Percentage reporting feature added.

Changes from Release 2.12:
Miranda checker added.

Changes from Release 2.9:
Java checker added.
The C checker 'sim' was renamed to 'sim_c', for uniformity.
Converted the sources to ANSI C.
All versions now report non_ASCI characters in the input.

Changes from Release 2.8:
DOS versions can now compare very large files (>400000 tokens)

Changes from Release 1.21, as posted in comp.sources.unix (1987):
Ported to MSDOS
Significant speed improvements
New options: -e, -S and / , to compare files group-wise
New option: -F , to require function names to match exactly
Lisp version added
Miscellaneous improvements


Dick Grune
Vrije Universiteit
de Boelelaan 1081
1081 HV Amsterdam
the Netherlands
email: [email protected]
ftp://ftp.cs.vu.nl/pub/dick
http://www.cs.vu.nl/~dick