06 Jul 2010

Actual Approaches For Sentiment Analysis (II)

9:00H

SENTENCE-Level Sentiment Classification

The same classification methods explained in the post related to “Document-level sentiment classification” can be applied to individual sentences. But in this case, it’s not assumed each sentence as an opinion. So, in these cases, it’s also necessary a previous task to classify sentences as subjective or objective. The sentences classified as subjective sentences are then classified as positive or negative opinions, what is called sentence-level sentiment classification.

Given a sentence, two sub-tasks are performed:

  • Subjectivity classification: Determine whether is a subjective sentence or an objective sentence.
  • Sentence-level sentiment classification: If is subjective, determine whether it expresses a positive or negative opinion.

Both problems are classification problems. Thus, traditional supervised learning methods are again applicable.

However, a limitation continues, and it’s the assumption that a sentence expresses a single opinion from a single opinion holder, fact that it’s only appropriate for simple sentences.

Because of this, it’s interesting to split compound sentences in simple sentences using NLP (Natural Language Processing) techniques. However, this is not always an easy task and entail different issues, as anaphora resolution and subject identification, that complicate the sentiment analysis task.

FEATURE-BASED SENTIMENT ANALYSIS

Although classifying opinionated texts at the document level or at the sentence level is useful in many cases, they do not provide the necessary detail needed for some other applications. A positive opinionated document on a particular object does not mean that the author has positive opinions on all aspects or features of the object. Likewise, a negative opinionated document does not mean that the author dislikes everything.

In a typical opinionated text, the author writes both positive and negative aspects of the object, although the general sentiment on the object may be positive or negative. Document-level and sentence-level classification does not provide such information. To obtain such details, we need to go to the object feature level.

For this, we need two key mining tasks:

  • Identify object features that have been commented on.

Current research on object feature extraction is mainly carried out in online product reviews. There are two common review formats on the Web.

  • Pros, cons and the detailed review: The reviewer is asked to describe Pros and Cons separately and also write a full review.
  • Free format: The reviewer can write freely, no separation Pros and Cons.
  • Determine whether the opinions on the features are positive, negative or neutral.

To identify the orientation of expressed opinions on an object feature in a sentence the sentence-level and clause-level sentiment classification methods discussed applicable here.

NOTE: Directly expressing positive or negative opinions on an object and its features is only one form of evaluation. Comparing the object with some other similar objects is another. Comparisons are related to but are also quite different from direct opinions. They have not only different semantic meanings, but also different syntactic forms. For example, a typical direct opinion sentence is “The acceleration of this car is amazing”, while a typical comparison sentence is “The acceleration of your car is much better than the acceleration of John’s Porsche”.

A comparative opinion expresses a relation of similarities or differences between two or more objects, and/or object preferences of the opinion holder based on some of the shared features of the objects. A comparative opinion is usually expressed using the comparative or superlative form of an adjective or adverb, although not always.

The treatment of this kind of phrases is different, and many of the available tools don’t evaluate them. In contrast, WebOpinion makes a special treatment of these phrases to extract all the possible opinionated elements of a text, including the comments associated to them.

26 Jun 2010

Actual Approaches For Sentiment Analysis (I): Document-Level Sentiment Classification

21:35H

As it was said in the post related to “Web Research and Opinion Retrieval”, sentiment analysis it’s not an easy task. Because of this, different studies and approaches have been used for it.

DOCUMENT-LEVEL SENTIMENT CLASSIFICATION

The first and easiest approach consists of classifying an opinionated document (e.g., a product review) as a positive or negative opinion. This task is commonly known as the document-level sentiment classification because it considers the whole document as the basic information unit.

In the document-level sentiment classification given a set of opinionated documents D, it determines whether each document d ∈ D expresses a positive or negative opinion (or sentiment) on an object.
Existing research and work on sentiment classification assume that an opinionated document d expresses opinions on a single object and the opinions are from a single opinion holder. These assumptions makes that this technique only could be applied in concrete contexts, because it’s difficult to find documents with only opinions on a single object.

Most existing techniques for document-level sentiment classification are based on supervised learning, although there are also some unsupervised methods.
Classification Based on Supervised Learning (document level)Sentiment classification can obviously be formulated as a supervised learning problem with two class labels (positive and negative), assuming that all the documents are known to be opinionated.
Training and testing data used in existing research are mostly product reviews with a reviewer-assigned rating (usually between 1 and 5 stars). Typically, a review with 4-5 stars is considered a positive review (thumbs-up), and a review with 1-2 stars is considered a negative review (thumbs-down).
Existing supervised learning methods can be readily applied to sentiment classification, for example, methods as Naïve Bayes, SVM (Support Vector Machines) or Maximum Entropy have been used with “good” results.
Some basic experiments using this approach, start using unigrams as features in classification with methods as Naïve Bayes or SVM. With the training set a classifier is obtained that using the unigrams that typically appears in positive and negative documents, is able to assign a polarity to a document. With this, looking if the unigrams of a document are closer to the unigrams that typically define positive or negative documents, a polarity will be assigned.
This approach has different limitations, some of them are related to the fact that it cannot take in account negations. This is because, for example, the term “not” will appear in positive and negative opinions, and it won’t be a relevant term to identify polarity. With this, “not good” and “good” will be classified as positive opinions, because “not” won’t be a polar term for the classifier.

read more…

22 Jun 2010

WebOpinion, an excellent tool for Online Reputation Management of Companies, brands, products, people

12:31H

A few years ago, a friend of mine told me that the service company where he was employed was working for a very important savings bank of the country and providing them with reports of its online activities. I commented to him that it was a great idea, identifying what were the most relevant news items and opinions and in an orderly manner translating it onto paper. I asked him what was the methodology used to obtain the information, because I had a vague memory that some other company was already working on a natural language process and webmining. My friend smiled and said that it was a very old methodology; he hired six part-time trainees spending hours and hours on the computer. Then they copied and pasted the results into a powerpoint template. He put his logo and delivered the result to the savings bank.

At that time I realized the potential of the product that was in the Online Reputation Management, taking into account the possibility of streamlining the processes and resources and to improve the manual methodology for obtaining information.

WebOpinion is the clearest example of evolutionary improvement of the ORM (Online Reputation Management). With WebOpinion, Online Reputation Management is already a fact, and possible with a single click, with no effort and completely automatic, to know what customers, suppliers and partners are saying about a Company, Brand, Product or Service and even People. It works by tracing the main sources of online information, tracing Blogs, Social Networks, Web 2.0 and Forums.

The “economics” or WebOpinion benefits are many, beginning with an operational cost savings together with improved accuracy and scope of the search results. Additionally, the tool calculates monthly IReOn (Online Reputation Index) automatically, which allows you to monitor monthly how the company / product / brand / person is in terms of online reputation. It is calculated by an algorithm that combines how well or badly clients speak of the company / brand / product / individual, whether the text is long or short with the importance of who made the comments (if we are talking about someone with authority and visibility in the network or not).

One of the clear advantages of WebOpinion is that it significantly shortens the reaction time from those involved in a crisis or disadvantage. Additionally it identifies which users are most active and what is the dissemination and visibility of their comments.

With WebOpinion, Companies can be more calm and can quickly identify and see the views that people put into the network about their brands, products, people, and even themselves as a whole. Once detected, an internal management system is needed, or some kind of support from the media agencies, an important pillar in managing the corrective action network.

Today, at the present time, at this very moment, people may be talking about you in the network, damaging your prestige and reputation. Or on the contrary people may be talking well about you to other potential customers and recommending the virtues of your products. In the measurement of reputation, the well known phrase “It is not important if they speak badly about you or if they speak well about you, but that they speak about you” is not true today. The Reputation Management must seek to achieve excellence with the maximum number of posts with positive feedback and minimizing the negative comments that users post.

08 Jun 2010

The Business Paradigm In The XXI Century: The Management Of The Reputation. Part II

10:34H

Continuing with the analysis of reputation, how can these negative comments affect the reputation of a company? Humans are strange. We focus more on the negative than positive. When a person gets a positive event, on average he communicates it to 3 people, whereas with a negative event, he communicates on average to 11. Are we crazy or masochist? I don’t know the answer to that question. But the truth is that it would be necessary to manage the chain of negative messages that often can become the nightmare of a company, a brand, a product or a person himself, damaging the reputation with third parties.

For this reason, a company must take care of its reputation as if it were their own subsistence. And beyond the corporate reputation a literal effort is crucial, to get a business leadership, in several fields including corporate social responsibility, quality of its products, responsible advertising, the truth of the communication process, openness of the people representing the Company, employee policies, its valuation in the online channel … it all adds up in the perspective of the client and his clinical eye.

It is therefore necessary but not sufficient to keep customers satisfied, loyal customers and manage their value through personalized and differentiated treatment. Customers cannot be cheated. Clients choose, test, consume, judge and communicate. And they are able to challenge the organizations when they don´t feel fully convinced. Reputation gets convinced and the lack of it “wins without convincing.”

By the year 2010, children join the world with Internet under one arm and the other with a mobile. Web 2.0 has been established in our lives and I’m in love with technology. Social networks, wikis, blogs, forums, applications for mobile perhaps unthinkable a few years ago. All technical machines have been created and launched so that our voice can reach anywhere in the world and anyone can read us, internalize and judge our words. And with that, we have become judges of Companies, and we can rent a special megaphone paying 25 euros per month per 3 Megabites guaranteed and 20 euros per mobile plan. Technology has provided an excellent opportunity as clients to share their life experiences on the network and has given us an excellent opportunity as entrepreneurs to manage that business reputation through specific tools, to help us quickly identify sources of problems.

Talking about tools and taking into account the ease of online opinion on Companies, Brands, Products and People, how can we manage the online communication processes of thousands of consumers and users, who can give their opinion on various sources and express themselves in an informal and in a free way? The answer to this question is called WEBOPINION and it’s the most reliable online reputation management tool that I will be describing in the next post.

31 May 2010

The Business Paradigm In The XXI Century: The Management Of The Reputation

15:52H

By the Spanish Language Royal Academy, Reputation means:

1. the opinion that people in general have about someone or something

2. a place in public esteem or regard : good name

Since the beginning of the year 2000, when we still didn’t have the thread of the crisis over our heads, I explained in my seminars to the marketing and business managers in Spanish companies, the different meanings of ‘reputation’, a word that I consider like the “word of the XXI century”.

In the middle nineties, when Internet was no more than 56k and a Netscape Navigator and the cell phones weighted nearly like the actual Mac Book Pro, the Services Companies started having interest in the achievement of the customer satisfaction and in the improvements of the quality services. It’s a challenge thinking that the gas operator comes to your house and he is going to take careful of your Persian carpet by wearing plastic covers on his boots!!

The challenge is done! The companies focused on the customer satisfaction and they tried to identify which the main satisfaction’s reasons were and above all, the dissatisfaction. The companies tried also to evaluate what kind of impact they had on their “happiness” as customers of this company.  All these developed through quantitative, like factorial analysis and multiple regression and of course, qualitative techniques, like the customers Focus Groups.

So, in 2000, the companies mastered the control of the customer’s satisfaction and they started to work on the building of customer loyalty to keep a loyal customer’s bases; that was increased through the called “Word of the Mouth” – or “mouth to ear” like I prefer to call it. The satisfied customer was loyal, because the programmes about the building of customer loyalty were based on “Gemini effect”, that owns an important emotional, rational and economic component. Normally, the customer could win some curious objects exchanging the accumulated points. A satisfied customer is a loyal customer that communicates his happiness to others potential customers…finally, the clients think that a company has the necessary reputation to be still his supplier.

read more…

24 May 2010

Why Does The Opinion Mining Emerge? Which kind Of Needs Does It Try To Cover?

15:44H

The growing popularity of the opinion sites, such as blogs, social networks and recommended sites, opens new opportunities and challenge in order to get different technologies for understanding and getting out the other’s opinions.

An enormous effort has been developed in opinion mining and the sentiment analysis to get this objective. Both of them concentrate all the efforts in the opinion computational treatment, the sentiment analysis and in the texts. Thanks to the growing interest in all this tools, there has been an authentic explosion in the number of companies that nowadays offer this sentiment analysis services. Just in the USA there are today more than 30 companies that nowadays are working on this area.

In the last years the Web 2.0 explosion had made that a big amount of information is shared through blogs, social networks and collaboration sites.

Information can be sorted in two general groups: facts and opinions.  The facts are expressions and information about companies, events and their properties. In other hand, the opinions are subjective sentiment, sensation and evaluations about companies, events and properties.

Opinions are valuable information for a particular user and for a marketing department in a company. If an online user wants to carry on a purchase it’s useful if the user can check before what the other’s opinion users are, and what idea they have about the brand, the product or service.

It’s also interesting for the company because it’s fundamental to know what the users think about and to reveal which the powers and debilities of your products are.

Few years before it was not so important the investment on the opinion search because the subjective texts nearly didn’t exist. During this period if someone needed to make a decision just asked friends or relatives. If a company wanted this information, it turned to opinion polls or group analysis. However these methods have changed a lot since the growing and development of the websites where the content has improved considerably.

Nowadays, if someone wants to buy a product, he hasn’t only to ask their friends and familiars because a lot of opinions exist about this product in the Web that shows the evaluation of the product made by different users. For a company, instead of carrying-out surveys or to hire people to obtain the customers’ opinions about their products and those of the competition, the content generated in the Web can give it this information.

read more…

19 May 2010

Web Research And Opinion Retrieval

9:55H

Recently the opinions’ research is very useful, and so is it  the web research. We offer a complete system that look for opinions after it has seen the whole web’s contents.

There are two typical ways of doing the research:

  1. We can find public opinions about an object or its characteristics. The object could be a product, person, organization, topic or event.
  2. We can find the opinions about a person or organizations related to an object or characteristic. That kind of research is interesting when the person or the organizations that express the opinions are indicated. In this way, we can know what the president of the United States, Barak Obama, thinks about other politician or about a brand.

However we cannot identify the author of each web opinion. We have to check determinate sources where the opinion’s author is included.

Like the traditional web’s research, the opinions’ research has two principles tasks:

  1. Recuperate important documents/sentences for an user’s research
  2. Arrange the relevant documents/sentences or based on some propriety (author, date, polarization).

Also there are important differences. When we recuperate that information or we research the opinions, we have to do two additional works:

  1. We have to find relevant documents or sentences for the topic that interest us.
  2. We have to determinate if a document or sentence expresses a negative or positive opinion. That work is only for the sentimental analyse and the opinion’s research is more complicated than the traditional research.

Important companies are working hard to grow up and improve the recent techniques to calculate the sentiment and to create a general and robust system to recuperate opinions that could become lead, as is Google in traditional information retrieval systems. The products as WebOpinion try to accomplish this task, using computational algorithms that can revolutionize the present world  of marketing.

WebOpinion

WebOpinion, the tools that alows to manage the Online Reputation, highly specialized in the treatment of the spanish language