Actual Approaches For Sentiment Analysis (II)
SENTENCE-Level Sentiment Classification
The same classification methods explained in the post related to “Document-level sentiment classification” can be applied to individual sentences. But in this case, it’s not assumed each sentence as an opinion. So, in these cases, it’s also necessary a previous task to classify sentences as subjective or objective. The sentences classified as subjective sentences are then classified as positive or negative opinions, what is called sentence-level sentiment classification.
Given a sentence, two sub-tasks are performed:
- Subjectivity classification: Determine whether is a subjective sentence or an objective sentence.
- Sentence-level sentiment classification: If is subjective, determine whether it expresses a positive or negative opinion.
Both problems are classification problems. Thus, traditional supervised learning methods are again applicable.
However, a limitation continues, and it’s the assumption that a sentence expresses a single opinion from a single opinion holder, fact that it’s only appropriate for simple sentences.
Because of this, it’s interesting to split compound sentences in simple sentences using NLP (Natural Language Processing) techniques. However, this is not always an easy task and entail different issues, as anaphora resolution and subject identification, that complicate the sentiment analysis task.
FEATURE-BASED SENTIMENT ANALYSIS
Although classifying opinionated texts at the document level or at the sentence level is useful in many cases, they do not provide the necessary detail needed for some other applications. A positive opinionated document on a particular object does not mean that the author has positive opinions on all aspects or features of the object. Likewise, a negative opinionated document does not mean that the author dislikes everything.
In a typical opinionated text, the author writes both positive and negative aspects of the object, although the general sentiment on the object may be positive or negative. Document-level and sentence-level classification does not provide such information. To obtain such details, we need to go to the object feature level.
For this, we need two key mining tasks:
- Identify object features that have been commented on.
Current research on object feature extraction is mainly carried out in online product reviews. There are two common review formats on the Web.
- Pros, cons and the detailed review: The reviewer is asked to describe Pros and Cons separately and also write a full review.
- Free format: The reviewer can write freely, no separation Pros and Cons.
- Determine whether the opinions on the features are positive, negative or neutral.
To identify the orientation of expressed opinions on an object feature in a sentence the sentence-level and clause-level sentiment classification methods discussed applicable here.
NOTE: Directly expressing positive or negative opinions on an object and its features is only one form of evaluation. Comparing the object with some other similar objects is another. Comparisons are related to but are also quite different from direct opinions. They have not only different semantic meanings, but also different syntactic forms. For example, a typical direct opinion sentence is “The acceleration of this car is amazing”, while a typical comparison sentence is “The acceleration of your car is much better than the acceleration of John’s Porsche”.
A comparative opinion expresses a relation of similarities or differences between two or more objects, and/or object preferences of the opinion holder based on some of the shared features of the objects. A comparative opinion is usually expressed using the comparative or superlative form of an adjective or adverb, although not always.
The treatment of this kind of phrases is different, and many of the available tools don’t evaluate them. In contrast, WebOpinion makes a special treatment of these phrases to extract all the possible opinionated elements of a text, including the comments associated to them.
