Large Language Models
Large Language Models
Keywords
Large language models - LLMs - Qualitative research - Software engineering
AI's capability to replace human roles. language models for software engineering: a systematic literature review. arXiv preprint
arXiv:2308.10620
Contrarily, empirical findings (Bano et al. 2023), rooted
in an understanding of LLM capabilities and extant [11] Jalil, S., Suzzana, R., Thomas, D.L., Kevin, M., Wing, L.: Chatgpt and software testing education:
research, debunk the AI doomsday notion, particularly for Promises & perils. In: 2023 IEEE International Conference on Software Testing, Veri cation and
qualitative researchers in software engineering. We project Validation Workshops (ICSTW), 4130–37. IEEE (2023)
a harmonious future where LLMs and human researchers [12] Jiang, D., Xiang R., Bill Y-L.: LLM-blender: Ensembling large language models with pairwise ranking
collaboratively further qualitative research. However, while and generative fusion. (2023) arXiv preprint arXiv:2306.02561.
LLMs, like GPT-4 and ChatGPT, show promise, the [13] Kitchenham, B.: Procedures for performing systematic reviews. Keele UK Keele Univ. 33(2004), 1–
ethical conduct, well-motivated studies, the validity and [14] Kuhail, M.A., Sujith, S.M., Ashraf, K., Jose, B., Syed J.S.: Will I be replaced? Assessing chatgpt's
reliability of research findings, and appropriate effect on software development and programmer perceptions of AI tools. Assessing Chatgpt's
dissemination remains pivotal. Effect on Software Development and Programmer Perceptions of AI Tools.
Considering the broader interaction between humans [15] Navigli, R., Simone, C., and Björn, R.: Biases in large language models: origins, inventory, and
and LLMs, while the latter's adeptness in qualitative data discussion. ACM J. Data Inform. Qual. (2023)
analysis can optimize certain facets of research, it is [16] Nguyen-Duc, A., Beatriz C.-D., Adam, P., Chetan, A., Dron, K., Tomas, H., Usman, R., Jorge, M.,
imperative to note their limitations in capturing the Eduardo, G., Kai-Kristian K.: Generative artificial intelligence for software engineering–a research
intricate nuances inherent to human researchers. This Agenda, (2023) arXiv preprint arXiv:2310.18648.
sentiment is echoed in seminal anthropological and [17] Ozkaya, I.: Application of large language models to software engineering tasks: opportunities. Risks
sociological works that emphasize the human touch in Implicat. IEEE Software. 40, 4–8 (2023)
interpreting and understanding data. Critically, the ethical [18] Polonsky, M.J., Jeffrey D.R.: Should artificial intelligent agents be your co-author? Arguments in
considerations surrounding LLM use, ranging from data favor, informed by ChatGPT. In: 91–96. SAGE Publications Sage UK: London, England (2023)
privacy to intellectual property rights, call for rigorous [19] Rudolph, J., Tan, S., Tan, S.: ChatGPT: bullshit spewer or the end of traditional assessments in
scrutiny. higher education? J. Appl. Learn. Teach. 24, 6 (2023)
[20] Scoccia, G.L.: Exploring Early Adopters' Perceptions of ChatGPT as a Code Generation Tool. In: 2023
[1] Alkaissi, H., McFarlane, S.I.: Artificial hallucinations in ChatGPT: implications in scientific writing. [21] Treude, C., Hideaki H.: She Elicits Requirements and he tests: software engineering gender bias in
Cureus 15, 192 (2023) large language models. (2023) arXiv preprint arXiv:2303.10131.
[2] Arora, C., John, G., Mohamed, A.: Advancing requirements engineering through generative AI: [22] Watkins, R.: Guidance for researchers and peer-reviewers on the ethical use of large language
assessing the role of LLMs. (2023) arXiv preprint arXiv:2310.13976. models (LLMs) in scientific research work ows. AI Ethics 16, 1-6 (2023)
[3] Balel, Y.: The role of artificial intelligence in academic paper writing and its
[4] Bano, M., Didar Z., Jon W.: Exploring qualitative research using LLMs. (2023)
arXiv preprint arXiv: 2306.13298. Bender, E.M., Timnit G., Angelina M.-M.,
[5] Byun, C., Piper, V., Kevin, S.: Dispensing with Humans in Human-Computer Interaction Research. In:
Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems, pp. 1–
26 (2023)
[6] Easterbrook, S., Singer, J., Storey, M.A., Damian, D.: Selecting empirical methods for software
[7] Ebert, C., Louridas, P.: Generative AI for software practitioners. IEEE Softw. 40, 30–38 (2023)
Emmert-Streib, F.: Importance of critical thinking to understand ChatGPT. Europ. J. Human Genet.
[8] Gentles, S.J., Cathy, C., Jenny, P., Ann Mckibbon, K.: Sampling in qualitative research: insights from
[9] Hoda, R.: Socio-technical grounded theory for software engineering. IEEE Transaction Software