Skip to content

We compare traditional tools with LLM-based tools to evaluate their effectiveness in data analytics education for non-computational professionals and students. Our approach, validated through two case studies, shows that using generative AI as the primary tool significantly improves learning efficiency and project development speed.

License

Notifications You must be signed in to change notification settings

jvalverr/data-analytics-education

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Advanced Large Language Models & Visualization Tools for Data Analytics Learning

Welcome to our project on Advanced Large Language Models & Visualization Tools for Data Analytics Learning! Our mission is to revolutionize the way non-computational professionals and students learn data analytics. We do this by harnessing the power of cutting-edge AI technologies like GPT-4 and the advanced visualization capabilities of tools like LIDA. Our research, supported by extensive case studies and published in specialized conferences and journals, shows how these tools can drastically improve both the speed and quality of data-related disciplines education. Join us on this exciting journey to make data analytics and data science more accessible, efficient, and engaging for everyone!

Project Overview

This project, based on two comprehensive studies, explores the use of advanced Large Language Models (LLMs) and visualization tools to enhance data analytics learning for students and professionals from non-computational backgrounds. The methodologies and outcomes described herein underscore the significant benefits of integrating cutting-edge AI technologies such as based on Generative AI (GenAI) into educational practices to foster a deeper understanding and more efficient execution of data-related projects.

Project Objectives

  • Promote a comprehensive understanding of data-based project pipelines.
  • Enhance programming and other computational thinking-related skills through interactive AI assistance.
  • Enable wider adoption of GenAI tools in educational contexts.
  • Improve the efficiency and effectiveness of data-related project development.

Methodology

The project unfolds in several key stages, as outlined in the case studies:

Participants background

Students and professionals from non-computational backgrounds. Specifically, 88% of participants came from fields such as finance, business, social sciences, and others, while the remaining 12% were from engineering disciplines including sustainable engineering, chemical engineering, biomedical engineering, and industrial engineering.

Case Study Design

  • Traditional Approach: Participants first completed a data analytics project using standard Python packages (e.g., scikit-learn, pandas, seaborn) in Google Colab.
  • ChatGPT Approach: Participants then repeated the project with conventional ChatGPT assistance, using the tool mainly for generating code snippets.
  • LIDA + GPT Approach: Finally, participants completed the project using LIDA integrated with the GPT-4 API, enabling automated data summarization, exploration, and advanced visualizations in response to any prompt originating from the project’s source code itself.

Key Findings

to be updated

Materials

Citation

When referencing this project, please use the following citation formats:

Journal Article:

Valverde-Rebaza, J., González, A., Navarro-Hinojosa, O., & Noguez, J. (2024). Advanced large language models and visualization tools for data analytics learning. Front. Educ. 9:1418006. DOI: 10.3389/feduc.2024.1418006.

@article{valverde:frontiers:24,
  title={Advanced large language models and visualization tools for data analytics learning},
  author={Valverde-Rebaza, J. and González, A. and Navarro-Hinojosa, O. and Noguez, J.},
  journal={Front. Educ.},
  volume={9},
  pages={1418006},
  year={2024},
  doi={10.3389/feduc.2024.1418006}
}

Conference Extended-Abstract:

Valverde-Rebaza, J., González, A., Navarro-Hinojosa, O., & Noguez, J. (2024). Empowering Data Analytics Learning: Leveraging Advanced Large Language Models and Visualization Tools. Proceedings of the 19th World Conference on Continuing Engineering Education, IACEE 2024, pp. 47-49. ISBN: 978-1-7327114-3-3.

@inproceedings{Valverde:iacee:24b, 
 author = {Valverde-Rebaza, J. and González, A. and Navarro-Hinojosa, O. and Noguez, J.},
 title = {{Empowering Data Analytics Learning: Leveraging Advanced Large Language Models and Visualization Tools}},
 booktitle = {Proceedings of The 19th World Conference on Continuing Engineering Education},
 series = {IACEE 2024},
 pages = {47--49},
 isbn = {978-1-7327114-3-3},
 publisher = {IACEE},
 year = {2024}
}

About

We compare traditional tools with LLM-based tools to evaluate their effectiveness in data analytics education for non-computational professionals and students. Our approach, validated through two case studies, shows that using generative AI as the primary tool significantly improves learning efficiency and project development speed.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •