Demo image
logo

A Similarity Detection Tool for Jupyter Notebooks

A Tool for Detecting Similarities in Jupyter Notebooks Used as Assessment Reports [Plagiarism Detection]

Learn more [Paper]

Similarity Score

...

Overall similarity score स𝐴←𝐵 indicates fraction of contents in A similar to B.

  • • स ∈ [𝟎, 𝟏]
  • • Asymmetric: स𝐴←𝐵 ≠ स𝐵←𝐴
See Details ...
Overall Score

Adjusting the aggressiveness of similarity detection algorithms by tuning:

  • Threshold 𝑇
  • • minimum block length 𝐿𝑚

By adujsting 𝑇 and 𝐿𝑚, JBEval can be adopted to diverse context, which allows us to use JBEval for assessments in different courses.

Adjusting & Adopting

...

Overall similarity score स𝐴←𝐵 indicates fraction of contents in A similar to B.

See Details ...
line-matching line-matching

Features

JBEval comes with following features:

Supports

Code/Text

JBEval detect similarity not only in code but also in text written in Jupyter Notebooks as part of report.

Currently it supports two programing languages and text contents written in Raw and Markdown cells
  • Code:Python
  • Code:R
  • Text:plain-text (Raw)
  • Text:HTML (Markdown)

Control aggressiveness

Control & Adopt

JBEval can be adopted to divers sets of context, by adjusting two main parameters.

Adjusting Threshold T and minimum block length Lmin
  • Programming heavy reports (Computer Science)
  • Programing + Text (Physics/Chemistry)
  • Light Programing + Text (Humanities)
  • Project reports / Assignments

Validation with visualisation

Visualisations

JBEval produces amazing diagrams and pictures to visualise the similarity between two reports, which helps to validate the plagiarims.

Validating the similarity cases
  • Similarity Maps
  • Visualising entire cohorts report
  • helps to validate the plagiarism cases
line-matching

Processing

Batch Processings

JBEval can be used to process batches of Jupyter-Notebooks, allowing to detect plagiarism with different cohorts

Processing in bulk
  • Fast batch processing
  • Comparing with previous cohorts
  • Detects plagiarism and collusions
  • Identifying groups of similar reports
line-matching

Comming soon ...

We are developing the online services of this tool for public use, which will be released with full details soon. For now, please refer to our publication.

...

Get Started Full Paper