Radio Station

Study Guide: Social Media Analytics & AI - Module 7

Yo, what it is! You know what it is, it’s your man Kingmusa— and welcome to The Study Guide! I'm here to break down today's class notes and help us learn together. Today we are going over Text Analytics - II (Sentiment Analysis) and we will be focusing on Module 7: Text Analytics - II (Sentiment Analysis)"

 

Let's dive into our module on Sentiment Analysis. We're exploring how it's used to understand emotions and opinions in text data.

Key Concept of the Day:

 Today, we're focusing on the basics of sentiment analysis. This includes understanding how it works, why it's important, and the challenges involved in accurately interpreting sentiment.Sentiment analysis, a superpower that helps us understand people’s feelings, finds hidden emotions and makes sense of their words. In this Communalytic tutorial, we’ll explore sentiment analysis and visualization using Super Bowl data. Submit your cleaned data by the end of the week. 


Microsoft Excel has a free sentiment analysis add-in from Azure Machine Learning, making it easy to start. Import text data from various sources, and the tool categorizes sentiments into positive, negative, and neutral. We’ll install the Azure Machine Learning add-in and explain the scoring system (0-1 scale). A higher negative score doesn’t necessarily indicate a more negative sentiment; context matters. The sentiment analysis tool is available with Office 365, accessible anytime, anywhere. The data cleaning example handles errors in column headers, ensuring clean data. The tool analyzes text data, providing a general understanding of sentiment. We’re not just talking about sentiment analysis; we’re also exploring ethical considerations of ChatGPT in higher education. Our research focused on public sentiment and ethical considerations using thematic analysis and sentiment analysis on a dataset of tweets. Key findings include authenticity, integrity, creativity, productivity, and research. 


The overall sentiment towards ChatGPT was positive! Public perception of AI integration in education is generally open, but concerns about academic integrity and ethics persist. Ethical frameworks are crucial for responsible AI incorporation. Future research should explore linguistic and platform scope to capture diverse global perspectives. Understanding public sentiment and engagement with ChatGPT is vital for informed AI use in higher education. Let’s delve into sentiment analysis and explore ethical considerations of ChatGPT in higher education. We eagerly share our findings!


Sentiment analysis helps us capture public opinion, understand customer feedback, and track social trends. It's a valuable tool for businesses, politicians, and researchers.


Here are the main points:

  1. Sentiment analysis determines the emotional tone in text.
  2. It assigns scores to text to categorize it as positive, negative, or neutral.
  3. Context, sarcasm, and ambiguity can make sentiment analysis challenging.
  4. Tools like Communalytic and Excel can be used for sentiment analysis.

Social media analysis is crucial for understanding public opinion, particularly regarding ChatGPT’s impact on education. However, research gaps exist due to limited studies on ChatGPT’s social media interactions. This research aims to understand ChatGPT’s pros, cons, and societal implications in higher education. Generative AI, utilizing deep learning models like GANs and VAEs, creates realistic content across various fields. While it offers benefits like content creation and personalized medicine, ethical concerns about AI-generated authenticity persist, exemplified by ChatGPT’s human-like writing capabilities. Generative AI is reshaping healthcare and finance, automating tasks and enhancing customer service. Responsible and ethical AI usage requires strong ethical guidelines and rethinking copyright laws. 


In education, personalized learning based on student data and interactive AI tutoring can revolutionize the learning experience. Virtual labs offer convenient and accessible experimentation alternatives. Social media has transformed communication and interaction, serving as a platform for information sharing and crisis relief. Platform X fosters discourse on diverse topics. It’s like a virtual hug for emotional support. Let’s discuss how people feel about AI in education. Some are excited, while others are worried. This study analyzes tweets about ChatGPT’s use in schools. We aim to understand the main topics and feelings expressed in these tweets. We’ll use methods like analyzing tweets and counting positive and negative feelings. We’ll start with a large dataset of 18,763 tweets from the X API, cleaning it to remove duplicates, weird stuff, and nonsensical tweets. We’ll keep tweets posted between December 1, 2022, and September 2023 to focus on real people. After cleaning, we’ll use Leximancer to understand the tweets and Voyant Tools to visualize the data.


Leximancer analyzes large text collections, while Voyant Tools is an open-source web-based tool for text analysis, used by catalogers, digital humanitarians, librarians, transcribers, and archivists. SentiStrength categorizes text into positive, negative, and neutral feelings using an algorithm, is easy to use, works with different languages, and is free. It analyzes informal language using a list of words with scores from -5 to 5. Analyzing ChatGPT’s sentiment aims to understand its response strength and strengths/weaknesses. SentiStrength’s accuracy is comparable to advanced machine learning models but relies on a fixed list, limiting adaptability. Automated analysis and manual checking identified five themes: Authenticity, Integrity, Creativity, Productivity, and Research. Authenticity involves creativity, originality, truthfulness, and accuracy. Integrity involves following ethical rules, reliability, honesty, and respecting user privacy. Creativity involves generating new ideas. Productivity involves ease, efficiency, and time management. Authenticity is the most important theme. Ethical concerns include following rules, reliability, honesty, and privacy. Data privacy, security, and inaccurate information risks are ethical concerns. Creativity is demonstrated by original ideas. Productivity is improved by ease, efficiency, and time management. Sentiment analysis results: 46.6% positive, 38.5% neutral, and 14.8% negative. Negative sentiment stems from concerns about accuracy and reliability. Students may rely on inaccurate information, which could be risky. 


We analyzed the data using Voyant-tools to find important terms and patterns. Ethical concerns include data privacy, security, potential risks, bias in AI-generated content, reduced critical thinking, over-reliance on AI-generated answers, and impact on learning. Technical challenges include integrating ChatGPT with existing platforms and handling complex queries. Public concerns include AI compromising academic honesty and ethical behavior, and the impact on creativity. ChatGPT could innovate learning but prompt a reevaluation of the creative process and AI’s role. Benefits include enhanced efficiency and optimized task management in education. Public sentiment is mixed, with optimism about potential benefits but concerns about misuse, bias, and the impact on human-led academic effort. Ethical implications include data privacy concerns due to sensitive information collection and potential bias in AI-generated content. Policy and practice implications include clear rules for ethical AI use, especially regarding data privacy, avoiding biases, and responsible development. AI literacy in education requires teachers learning about AI tools, their strengths and weaknesses, biases, and data privacy issues. Transparency and trust are crucial, especially in building trust between students and faculty. Clear processes are key. Continuous monitoring and evaluation of AI tools are crucial to ensure ethical use. 


Many believe AI can personalize learning and boost productivity, but stronger ethical guidelines and responsible integration are also needed. Educators, policymakers, and tech developers must collaborate to improve learning while preserving academic values. The study focused on English-language tweets, potentially limiting its global applicability. The biased dataset, favoring X users, may also affect its representativeness. Exploring other social media platforms could provide a more comprehensive understanding of AI in education.


The College of Liberal and Applied Arts at Stephen F. Austin State University supports research funding. GAE curates data, writes the original draft, manages the project, and secures funding. HKA reviews and edits the manuscript, creates visualizations, and supervises the project. GAE performs data curation. AI in Creative Industries examines AI’s use in the creative industries. Generative AI in Education reviews its use in educational settings. ChatGPT in Medical Education explores its potential impact and opportunities in medical education. AI in Art (Cetinic and She, 2022) explores how people use AI to create art. Academic Integrity with ChatGPT (Cotton, Cotton, and Shipway, 2023) discusses maintaining academic integrity in the age of ChatGPT. Ethical ChatGPT in Education (Crawford, Cowling, and Allen, 2023) emphasizes ethical leadership when using ChatGPT for character development, assessment, and learning. ChatGPT in Education discusses its impact on industry and higher education, focusing on transdisciplinarity and digital humanities. AI-Powered Learning explores how AI-based learning content generation and pathway augmentation enhance learner engagement. ChatGPT Integration in Education investigates how ChatGPT can be integrated into educational settings. This study explores ChatGPT’s applications and implications in various fields, particularly in medical education. It analyzes Indonesian tweets about the COVID-19 pandemic using Sentistrength, examines GANs in creative industries for human-AI collaboration, and investigates audience engagement with ChatGPT-related content on YouTube. 


Leximancer, a specialized software for text analysis, is also mentioned. In education, studies by Li et al. (2023) identified concerns about ChatGPT’s adoption, Lund et al. (2023) discussed ethical considerations, Nguyen (2023) examined data risks in news media covering big data and AI, Öztürk and Ayvaz (2018) analyzed Twitter data on the Syrian refugee crisis, and Rudolph, Tan, and Tan (2023) discussed generative AI’s potential in education. The study also examines ChatGPT’s use in generating quantitative research data in tourism. A case study approach was employed to explore ChatGPT’s implications for data generation in quantitative research. NVivo and Leximancer are compared for qualitative data analysis, and consumer engagement through interactive AI and mixed reality experiences are explored. In education, Wu and Yu (2023) conducted a meta-analysis on AI chatbots’ impact on student learning outcomes, Wang et al. (2023) investigated blockchain technology for risk prediction and credibility assessment in online public opinion, and Vilares et al. (2015) developed a Spanish sentiment analysis tool for analyzing political tweets in real time. Social media data, a valuable resource comparable to oil, constitutes a significant portion of “big data.” It reveals that many Americans use platforms like Facebook and Twitter, providing ample data. This module teaches students how to analyze social media data, including content and sentiment analysis. Twitter, the fourth most visited website in the US and eighth globally, attracts over 542 million monthly visitors as of August 2017. Despite its importance in marketing and communications, companies face challenges in crafting effective campaigns. Content analysis, a systematic and quantitative approach, converts qualitative data into quantitative data by analyzing communicative messages. It identifies and classifies meaningful content, revealing implied meanings and patterns. 


Content analysis is used to analyze various forms of communication, including spoken, written, audio-visual, and online content, to understand human behavior, beliefs, and reactions. Originating in the 17th century with systematic newspaper analysis, content analysis evolved into a scientific method in the 1940s for mass media analysis. It involves finding, understanding, and showing important things in data, making complex information easier to grasp and helping to spot patterns, connections, and people’s thoughts. Content analysis has various applications, such as Nescafé Dolce Gusto’s use to understand coffee lovers and increase Facebook likes. Twitter data analysis provides insights into tweet replies and most frequently used words. In social media, content analysis examines language, speech acts, arguments, word associations, and behavior. For instance, a study by Cruz and Lee (2014) used Twitter data to identify challenges companies face in running campaigns on the platform. They studied how brands are perceived and how people feel about them.


In a nutshell, this module introduces the core concepts of sentiment analysis, providing a foundation for understanding how to analyze and interpret emotions and opinions in textual data. Content Analysis Process: This involves collecting, examining, coding, and counting social media content to gain insights. Two main approaches are inductive (deriving categories from data) and deductive (validating or extending theories). Coding organizes text data into categories by labeling words and phrases, capturing themes and relationships. The purpose is to identify recurring themes for analysis and summarization. Steps include selecting a coding approach, reading the data, coding line by line, categorizing codes, and identifying recurring themes. Hierarchical frames organize codes based on relationships. 


Coding tips include creating specific codes and applying generic codes. Avoid overly similar codes and capture both positive and negative sentiments. Group responses based on themes. Sentiment analysis analyzes content to identify expressed sentiment, commonly used for public opinion insights. It’s a complex NLP field analyzing emotions and opinions in online content, especially social media posts. Categories are primarily positive, negative, and neutral. Contextual nuances, ambiguity, and sarcasm can complicate accurate sentiment analysis. Human perception is more accurate than tools, with good tools achieving around 70% accuracy. Accurate sentiment analysis considers context and meaning of words and phrases, not just individual keywords. Tools struggle with sarcasm, social semantics, and understanding expressions. Understanding audience sentiment is crucial for gauging customer preferences and desires towards a brand. Customer Service Improvement monitors sentiment to identify issues and resolve complaints promptly. Brand and Product Development analyzes customer sentiment to improve messages and products. Hate Speech Detection identifies racist or sexist tweets. 


Data Preprocessing cleans up text for analysis and machine learning. Sentiment Analysis in Tweets determines tweet sentiment based on keywords and phrases. Sentiment Analysis Definition is part of NLP that processes human language to understand feelings. Sentiment Analysis Tools like SentiStrength use word lists to analyze social media text. Social Listening Tools use automated sentiment analysis as a feature.


That wraps up today’s episode of The Study Guide. Remember, we teach to learn, and I hope this has helped you understand Module 7: Text Analytics - II (Sentiment Analysis) better. Keep studying, keep learning, and keep pushing toward your academic goals. Don’t forget to follow me on all platforms @Kingmusa428 and check out more episodes at kingmusa428.com. See y’all next time!"


Post a Comment

0 Comments