![]() summarized the preprocessing techniques and performed experiments to prove they improve significantly the accuracy of classifiers. Those will clean text, normalize text and only keep useful information, Symeonidis et al. Especially, reviews in e-commerce system, blogs and social media are informal, so they contain so much noisy information, unnecessary in detecting the sentiment. Preprocessing techniques are frequently used in natural language processing to prepare text that is going to be classified. Furthermore, we have investigated to enhance more training data automatically in order to improve the performance of the models with the limited training data. Our study is based on semi-supervised learning and used many preprocessing techniques to normalize the data such as emoji icons replacement, elongated characters removal, negation handling, intensification handling. This motivates us to develop an effective solution to text classification in general and sentiment analysis in particular. Especially, the Vietnamese training data are not abundant and lack so much preventing many propositions in research team. This depends on the size and quality of the pre-labeled datasets which are scarce and unavailable for a certain application, they are tedious to collect, expensive and time-consuming to build, depend on domain adaptation and ineffectively handle unseen data. The traditional approach is usually supervised learning, supervised classifiers are used such as Naive Bayes, SVM, logistic regression, ensemble of voting classifiers, also investigating on feature selection for retaining useful features and ignoring redundant features to improve the performing approach. made a survey summarized sentiment analysis methods, including in text, it showed many previous researches in supervised and unsupervised learning. Actually, sentiment analysis is text classification problem which can apply machine learning classifiers in emotional polarities, Soleymani et al. Lexicon-based approach replies on the emotional lexicons to detect customers’ emotions, its main drawbacks are to depend on the context and languages. , there are three main approaches for sentiment analysis: lexicon-based approach, machine learning approach, hybrid approach. Another challenge is to select the most relevant techniques or approaches to classify the sentiment polarities.Īccording to Medhat et al. These have made it hard to analyze text structure, especially detecting negation text being big challenge impacting sentiment detection and text structure evaluation. The most difficult is working with unstructured sentiment, the writer is not required to comply with any constraints: using slang terms, wrong spelling, wrong grammar structure, etc. The first one is text structure, Hussein made a survey on sentiment analysis challenges by comparing many past studies the authors showed types of text structure for sentiment analysis: (i) structured sentiments are format sentiment text (ii) unstructured sentiments are informal and free text (iii) semi-structured sentiments are between format structured text and unstructured text. Working with sentiment analysis faces many challenges. Based on automatic prediction, the traders can make decision easier, and also plan the direction to develop their business. Its main task groups the document into various polarities. Sentiment analysis is an essential task to detect the sentiment polarities in the text applied widely in e-commerce system, blogs, social media. A perfect solution for this problem is sentiment analysis which promises commercial benefits. ![]() This is costs both human resources and money a lot, reaches the customers’ expectation is slow and easy to miss ones. However, this cannot be perform manually, it imagines that many employees follow the customers’ replies about a product, read and analyze the hundreds or thousands of the replies to evaluate the degree of customers’ satisfaction for making next strategies about the products or taking a direction in development. These help to understand the customers’ expectation, evaluate the advantages and disadvantages of the products, also predict the product trends to satisfy the customers’ expectation quickly as possible as. The customers can reply their reviews rapidly, the providers can receive an abundance of the customers’ reviews. An enormous and rapid growth of the Web technologies has changed the way to buy and reply the reviews of the bought products.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |