Högskolan i Skövde

his.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Leveraging large language models for accurate Cypher query generation: Natural language query to Cypher statements
University of Skövde, School of Informatics.
2024 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

The rise of Large Language Models (LLMs) has transformed various fields, including education, health, natural language processing, code generation, content creation, and more. 

The study seeks to use large language models to generate Cypher Queries based on natural language questions. The main objective of the study is to leverage and evaluate large language models and measure their Cypher Query Generation capabilities. 

The study utilizes GPT-3.5 turbo and Code Llama 2 for cypher generation in datasets collected and annotated across three categories: movies, network management, and companies. The study uses In-Context learning and QLoRA for fine-tuning the large language models. The BLEU and ROUGE evaluations indicate that GPT-3.5 turbo, utilizing the InContext learning method, outperforms the Code Llama 2, a fine-tuned model with QLoRA. 

The main challenges faced in this study are the unavailability of datasets and limited computational resources, such as GPU. 

Place, publisher, year, edition, pages
2024. , p. v, 47
Keywords [en]
Large language models, natural language generation, Cypher query, text generation, deep learning
National Category
Information Systems, Social aspects
Identifiers
URN: urn:nbn:se:his:diva-24158OAI: oai:DiVA.org:his-24158DiVA, id: diva2:1881385
Subject / course
Informationsteknologi
Educational program
Data Science - Master’s Programme
Supervisors
Examiners
Available from: 2024-07-03 Created: 2024-07-03 Last updated: 2024-07-03Bibliographically approved

Open Access in DiVA

fulltext(1482 kB)374 downloads
File information
File name FULLTEXT01.pdfFile size 1482 kBChecksum SHA-512
638356616e01a7d92564d201cd39c4fcb237376549309ac0f0ed75a0d104701fe1a20cf40f81f1176c0dab3d840ba1ee0d58dec1fe4d0fb44898efa744318a9a
Type fulltextMimetype application/pdf

By organisation
School of Informatics
Information Systems, Social aspects

Search outside of DiVA

GoogleGoogle Scholar
Total: 374 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 1010 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf