Jonas Maebe 4b8643f73b * removed some clashes between ifdefs and regular tokens 17 년 전
..
Answers 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
ChangeLog 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
HOWTO-FPC.txt de08321368 + short instructions on how to use it 17 년 전
LICENSE.txt 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
Makefile 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
READ.ME 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
README.1st 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
READ_ME 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
TechnReport 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
add_run.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
add_run.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
aiso.bdy 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
aiso.spc 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
algollike.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
algollike.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
clang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
compare.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
compare.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
debug.par 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
error.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
error.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
hash.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
hash.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
idf.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
idf.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
javalang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
lang.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
language.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
lex.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
lex.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
lisplang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
m2lang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
miralang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
options.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
options.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
pascallang.l 4b8643f73b * removed some clashes between ifdefs and regular tokens 17 년 전
pass1.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
pass1.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
pass2.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
pass2.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
pass3.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
pass3.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
percentages.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
percentages.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
runs.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
runs.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
settings.par 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
sim.1 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
sim.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
sim.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
sim.html 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
sim.txt 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
sortlist.bdy 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
sortlist.spc 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
stream.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
stream.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
sysidf.mk 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
sysidf.msdos 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
sysidf.unix 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
system.par 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
text.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
text.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
textlang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
token.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
token.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
tokenarray.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전
tokenarray.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 년 전

README.1st

This is SIM, Software and text similarity tester, most recent revision
(2.19, 20050220)
by Dick Grune, Vrije Universiteit, Amsterdam, the Netherlands ([email protected]).

SIM tests lexical similarity in texts in C, Java, Pascal, Modula-2, Lisp,
Miranda and natural language. It can be used

- to detect potentially duplicated code fragments in large software projects,
- to detect plagiarism in software and text-based projects, educational and
otherwise.

The program is fast:
the UNIX version on a Sun ULTRA does about 50000 tokens/sec,
the DOS version on a Pentium 166 does about 25000 tokens/sec.

SIM is available for UNIX (in source code) and MSDOS (32-bit executables).

UNIX:
To obtain the files, do:
sh sim_2_21.shar
This unpacks the sources, the Makefile, sim.1 and READ_ME.
For installation notes and other info then see READ_ME.

MSDOS:
To obtain the files, do:
[pk]unzip SIM_2_21.zip
This unpacks the executables, SIM.DOC and READ.ME.
For other info then see READ.ME.

Changes from Release 2.19:
Various changes necessitated by Linux flex being different

Changes from Release 2.16:
Various updates and adjustments in the code and the installation
procedure.

Changes from Release 2.13:
Percentage reporting feature added.

Changes from Release 2.12:
Miranda checker added.

Changes from Release 2.9:
Java checker added.
The C checker 'sim' was renamed to 'sim_c', for uniformity.
Converted the sources to ANSI C.
All versions now report non_ASCI characters in the input.

Changes from Release 2.8:
DOS versions can now compare very large files (>400000 tokens)

Changes from Release 1.21, as posted in comp.sources.unix (1987):
Ported to MSDOS
Significant speed improvements
New options: -e, -S and / , to compare files group-wise
New option: -F , to require function names to match exactly
Lisp version added
Miscellaneous improvements


Dick Grune
Vrije Universiteit
de Boelelaan 1081
1081 HV Amsterdam
the Netherlands
email: [email protected]
ftp://ftp.cs.vu.nl/pub/dick
http://www.cs.vu.nl/~dick