2
0
Jonas Maebe 4b8643f73b * removed some clashes between ifdefs and regular tokens 17 жил өмнө
..
Answers 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
ChangeLog 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
HOWTO-FPC.txt de08321368 + short instructions on how to use it 17 жил өмнө
LICENSE.txt 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
Makefile 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
READ.ME 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
README.1st 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
READ_ME 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
TechnReport 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
add_run.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
add_run.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
aiso.bdy 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
aiso.spc 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
algollike.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
algollike.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
clang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
compare.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
compare.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
debug.par 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
error.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
error.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
hash.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
hash.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
idf.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
idf.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
javalang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
lang.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
language.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
lex.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
lex.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
lisplang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
m2lang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
miralang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
options.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
options.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
pascallang.l 4b8643f73b * removed some clashes between ifdefs and regular tokens 17 жил өмнө
pass1.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
pass1.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
pass2.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
pass2.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
pass3.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
pass3.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
percentages.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
percentages.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
runs.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
runs.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
settings.par 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
sim.1 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
sim.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
sim.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
sim.html 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
sim.txt 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
sortlist.bdy 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
sortlist.spc 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
stream.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
stream.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
sysidf.mk 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
sysidf.msdos 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
sysidf.unix 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
system.par 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
text.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
text.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
textlang.l 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
token.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
token.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
tokenarray.c 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө
tokenarray.h 30e4da99da * source code similarity tester (import of original 2.21 sources available 17 жил өмнө

README.1st

This is SIM, Software and text similarity tester, most recent revision
(2.19, 20050220)
by Dick Grune, Vrije Universiteit, Amsterdam, the Netherlands ([email protected]).

SIM tests lexical similarity in texts in C, Java, Pascal, Modula-2, Lisp,
Miranda and natural language. It can be used

- to detect potentially duplicated code fragments in large software projects,
- to detect plagiarism in software and text-based projects, educational and
otherwise.

The program is fast:
the UNIX version on a Sun ULTRA does about 50000 tokens/sec,
the DOS version on a Pentium 166 does about 25000 tokens/sec.

SIM is available for UNIX (in source code) and MSDOS (32-bit executables).

UNIX:
To obtain the files, do:
sh sim_2_21.shar
This unpacks the sources, the Makefile, sim.1 and READ_ME.
For installation notes and other info then see READ_ME.

MSDOS:
To obtain the files, do:
[pk]unzip SIM_2_21.zip
This unpacks the executables, SIM.DOC and READ.ME.
For other info then see READ.ME.

Changes from Release 2.19:
Various changes necessitated by Linux flex being different

Changes from Release 2.16:
Various updates and adjustments in the code and the installation
procedure.

Changes from Release 2.13:
Percentage reporting feature added.

Changes from Release 2.12:
Miranda checker added.

Changes from Release 2.9:
Java checker added.
The C checker 'sim' was renamed to 'sim_c', for uniformity.
Converted the sources to ANSI C.
All versions now report non_ASCI characters in the input.

Changes from Release 2.8:
DOS versions can now compare very large files (>400000 tokens)

Changes from Release 1.21, as posted in comp.sources.unix (1987):
Ported to MSDOS
Significant speed improvements
New options: -e, -S and / , to compare files group-wise
New option: -F , to require function names to match exactly
Lisp version added
Miscellaneous improvements


Dick Grune
Vrije Universiteit
de Boelelaan 1081
1081 HV Amsterdam
the Netherlands
email: [email protected]
ftp://ftp.cs.vu.nl/pub/dick
http://www.cs.vu.nl/~dick