Research and Application of Code Similarity Based on Submission

  • Yu Lang Shanghai Maritime University
Article ID: 810
312 Views
Keywords: Commit, Incremental Analysis, Code Similarity, Code Recommendation

Abstract

With the continuous accumulation of resources, the similarity detection of code is becoming more difficult, and the difficulty of code reusing and rechecking is also increasing. In view of this problem, this paper proposes a code recommendation and check-research based on submission, which uses differential code cloning and word vector methods to find candidate code sets that are similar to incremental text, and uses feature extraction and clustering to select the most relevant codes from the candidate code sets to obtain repetitive codes. At the same time, it is recommended to programmers combined with relevance scores. Experimental results show that this method is feasible to some extent.

References

Naur P, Randell B. Software engineering: Report of a conference sponsored by the NATO Science Committee, Garmisch, Germany, 7-11 Oct. 1968. Brussels: Scientific Affairs Division, NATO; 1969.

Halstead MH. Elements of software science. New York: Elsevier Science Inc; 1978.

Verco KL, Wise MJ. Software for detecting suspected plagiarism: Comparing structure and attribute-counting systems. Proceedings of the 1st Australasian conference on Computer science education; 1996 Jul; Sydney, Australia. 1996. p. 130-134.

Xu F, Hao L, Chen F, et al. A comparative analysis method for open source code reuse (in Chinese). Computer Engineering 2020; 46(1): 222-228+242.

Hu Z. Research and application of program code similarity detection method (in Chinese). Central South University; 2012. doi: 10.7666/d.y2197724.

Qiao Y, Jiang Q, Gu L, et al. Classification of malicious code based on assembly instruction word vector and convolutional neural network (in Chinese). Netinfo Security 2019; (4): 20-28. doi: CNKI:SUN:XXAQ.0.2019-04-004.

Yin K. Research on block completion recommendation algorithm based on differential code cloning search (in Chinese). University of Science and Technology of China; 2019.

Published
2020-06-02
How to Cite
Lang, Y. (2020). Research and Application of Code Similarity Based on Submission. Journal of Networking and Telecommunications, 2(1), 3-6. https://doi.org/10.18282/jnt.v2i1.810
Section
Original Research Articles