Jonas Maebe 4b8643f73b * removed some clashes between ifdefs and regular tokens %!s(int64=17) %!d(string=hai) anos
..
Answers 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
ChangeLog 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
HOWTO-FPC.txt de08321368 + short instructions on how to use it %!s(int64=17) %!d(string=hai) anos
LICENSE.txt 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
Makefile 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
READ.ME 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
README.1st 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
READ_ME 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
TechnReport 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
add_run.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
add_run.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
aiso.bdy 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
aiso.spc 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
algollike.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
algollike.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
clang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
compare.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
compare.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
debug.par 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
error.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
error.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
hash.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
hash.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
idf.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
idf.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
javalang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
lang.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
language.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
lex.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
lex.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
lisplang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
m2lang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
miralang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
options.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
options.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
pascallang.l 4b8643f73b * removed some clashes between ifdefs and regular tokens %!s(int64=17) %!d(string=hai) anos
pass1.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
pass1.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
pass2.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
pass2.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
pass3.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
pass3.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
percentages.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
percentages.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
runs.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
runs.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
settings.par 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
sim.1 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
sim.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
sim.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
sim.html 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
sim.txt 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
sortlist.bdy 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
sortlist.spc 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
stream.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
stream.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
sysidf.mk 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
sysidf.msdos 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
sysidf.unix 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
system.par 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
text.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
text.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
textlang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
token.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
token.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
tokenarray.c 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos
tokenarray.h 30e4da99da * source code similarity tester (import of original 2.21 sources available %!s(int64=17) %!d(string=hai) anos

README.1st

This is SIM, Software and text similarity tester, most recent revision
(2.19, 20050220)
by Dick Grune, Vrije Universiteit, Amsterdam, the Netherlands ([email protected]).

SIM tests lexical similarity in texts in C, Java, Pascal, Modula-2, Lisp,
Miranda and natural language. It can be used

- to detect potentially duplicated code fragments in large software projects,
- to detect plagiarism in software and text-based projects, educational and
otherwise.

The program is fast:
the UNIX version on a Sun ULTRA does about 50000 tokens/sec,
the DOS version on a Pentium 166 does about 25000 tokens/sec.

SIM is available for UNIX (in source code) and MSDOS (32-bit executables).

UNIX:
To obtain the files, do:
sh sim_2_21.shar
This unpacks the sources, the Makefile, sim.1 and READ_ME.
For installation notes and other info then see READ_ME.

MSDOS:
To obtain the files, do:
[pk]unzip SIM_2_21.zip
This unpacks the executables, SIM.DOC and READ.ME.
For other info then see READ.ME.

Changes from Release 2.19:
Various changes necessitated by Linux flex being different

Changes from Release 2.16:
Various updates and adjustments in the code and the installation
procedure.

Changes from Release 2.13:
Percentage reporting feature added.

Changes from Release 2.12:
Miranda checker added.

Changes from Release 2.9:
Java checker added.
The C checker 'sim' was renamed to 'sim_c', for uniformity.
Converted the sources to ANSI C.
All versions now report non_ASCI characters in the input.

Changes from Release 2.8:
DOS versions can now compare very large files (>400000 tokens)

Changes from Release 1.21, as posted in comp.sources.unix (1987):
Ported to MSDOS
Significant speed improvements
New options: -e, -S and / , to compare files group-wise
New option: -F , to require function names to match exactly
Lisp version added
Miscellaneous improvements


Dick Grune
Vrije Universiteit
de Boelelaan 1081
1081 HV Amsterdam
the Netherlands
email: [email protected]
ftp://ftp.cs.vu.nl/pub/dick
http://www.cs.vu.nl/~dick