Högskolan i Skövde

his.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Evaluating large language models’ capability to generate algorithmic code using prompt engineering
University of Skövde, School of Informatics.
University of Skövde, School of Informatics.
University of Skövde, School of Informatics.
2024 (English)Independent thesis Basic level (degree of Bachelor), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

The study evaluated the performance of large language models (LLMs) such as Gemini, ChatGPT- 4, and GitHub Copilot in generating C++ algorithms for specific tasks using different prompting techniques. The central aim was to assess the effectiveness of these models in creating code solutions that are both functionally correct and complete, using a combination of automated unit tests and human evaluation. Across two main tasks (Social Network and Huffman Encoding), the models showed different levels of success in generating functionally correct code. Github Copilot and ChatGPT-4 generally produced more syntactically accurate and functionally appropriate code than Gemini, There was a notable variation in completeness, whether the code met all the tasks’ specified requirements. Some models managed to include all necessary functionalities more consistently than others. Gemini, for instance, excelled in generating complete solutions for the Social Network task but had issues with the Huffman Encoding task, where its output often did not integrate the provided code effectively or correctly. 

Place, publisher, year, edition, pages
2024. , p. 3, 42, xv
Keywords [en]
LLM, large language model, ChatGPT, Gemini, Github Copilot, prompt engineering, algorithm
National Category
Information Systems, Social aspects
Identifiers
URN: urn:nbn:se:his:diva-24285OAI: oai:DiVA.org:his-24285DiVA, id: diva2:1883138
Subject / course
Informationsteknologi
Educational program
Computer Science - Specialization in Systems Development
Supervisors
Examiners
Available from: 2024-07-09 Created: 2024-07-09 Last updated: 2024-07-09Bibliographically approved

Open Access in DiVA

fulltext(1324 kB)229 downloads
File information
File name FULLTEXT01.pdfFile size 1324 kBChecksum SHA-512
ca05f209d8b06cf402baf01f41bda8bb96b4bab8275dd0db6f413c7bcac04196d06eaf0712dffcc06145ffd0f2ca07642b7f0d50a9c14d0ee13cc7aa1825be5b
Type fulltextMimetype application/pdf

By organisation
School of Informatics
Information Systems, Social aspects

Search outside of DiVA

GoogleGoogle Scholar
Total: 229 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 600 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf