<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>

<channel>
	<title>Sylvain Paillard</title>
	<atom:link href="http://www.sylvainpaillard.com/wordpress/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://www.sylvainpaillard.com/wordpress</link>
	<description>www.sylvainpaillard.com</description>
	<pubDate>Sun, 28 Mar 2010 04:46:47 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>VLOG - Auto-Tagging, the quest for Compounds!</title>
		<link>http://www.sylvainpaillard.com/wordpress/?p=474</link>
		<comments>http://www.sylvainpaillard.com/wordpress/?p=474#comments</comments>
		<pubDate>Sat, 27 Mar 2010 20:42:30 +0000</pubDate>
		<dc:creator>Sylvain Paillard</dc:creator>
		
		<category><![CDATA[artificial intelligence]]></category>

		<category><![CDATA[customer experience]]></category>

		<category><![CDATA[Auto-tagging]]></category>

		<category><![CDATA[Compound]]></category>

		<category><![CDATA[Tag cloud]]></category>

		<category><![CDATA[Term Extraction]]></category>

		<guid isPermaLink="false">http://www.sylvainpaillard.com/wordpress/?p=474</guid>
		<description><![CDATA[In this video, I try to explain the concept of Auto-tagging, which can be considered as the task of extracting terms automatically out of unstructured textual data
Terminology mining, term extraction, term recognition, or glossary extraction, is a subtask of information extraction. The goal of terminology extraction is to automatically extract relevant terms from a given [...]]]></description>
			<content:encoded><![CDATA[<p><em>In this video, I try to explain the concept of Auto-tagging, which can be considered as the task of extracting terms automatically out of unstructured textual data</em></p>
<blockquote><p>Terminology mining, term extraction, term recognition, or glossary extraction, is a subtask of information extraction. The goal of terminology extraction is to automatically extract relevant terms from a given corpus.</p>
<p>In the semantic web era, a growing number of communities and networked enterprises started to access and interoperate through the internet. Modeling these communities and their information needs is important for several web applications, like topic-driven web crawlers, web services, recommender systems, etc. The development of terminology extraction is essential to the language industry.</p>
<p><i>From Wikipedia - Terminology extraction</i></p></blockquote>
<p>In this log, I speak about Auto-tagging:</p>
<ul>
<li>Why we need it</li>
<li>What is the challenge</li>
<li>The different approaches: naive term frequency versus global semantic web-service</li>
<li>The key role of compounds (i.e.: double-worded tags)</li>
</ul>
<div class="flvPlayer">				<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,0,0" width="640" height="500"><param name="movie" value="http://www.sylvainpaillard.com/wordpress/videos/flvplayer.swf?file=http://www.sylvainpaillard.com/wordpress/videos/2010-03-27 - auto-tagging.flv&amp;autoStart=false;" /><param name="quality" value="high" /><param name="wmode" value="transparent" /><embed src="http://www.sylvainpaillard.com/wordpress/videos/flvplayer.swf?file=http://www.sylvainpaillard.com/wordpress/videos/2010-03-27 - auto-tagging.flv&amp;autoStart=false;" quality="high" wmode="transparent" width="640" height="500" type="application/x-shockwave-flash" pluginspage="http://www.macromedia.com/go/getflashplayer" /><br />
				</object></div>

]]></content:encoded>
			<wfw:commentRss>http://www.sylvainpaillard.com/wordpress/?feed=rss2&amp;p=474</wfw:commentRss>
		</item>
		<item>
		<title>VLOG - Search &#038; Tag Clouds</title>
		<link>http://www.sylvainpaillard.com/wordpress/?p=449</link>
		<comments>http://www.sylvainpaillard.com/wordpress/?p=449#comments</comments>
		<pubDate>Wed, 17 Mar 2010 21:02:55 +0000</pubDate>
		<dc:creator>Sylvain Paillard</dc:creator>
		
		<category><![CDATA[customer experience]]></category>

		<category><![CDATA[Auto-tagging]]></category>

		<category><![CDATA[Search & Navigation]]></category>

		<category><![CDATA[Search cloud]]></category>

		<category><![CDATA[Tag cloud]]></category>

		<guid isPermaLink="false">http://www.sylvainpaillard.com/wordpress/?p=449</guid>
		<description><![CDATA[This is my first video log. I will try to address topics more openly and frequently this way.
In this log, I speak about Search and Tag Clouds:

Difference between Search and Tag Clouds
Similarity between Search Clouds and Search Autocomplete Suggestions
Potential to extend their usage for navigation
Challenge to mix search and tags Clouds together (usage versus document [...]]]></description>
			<content:encoded><![CDATA[<p><em>This is my first video log. I will try to address topics more openly and frequently this way.</em></p>
<p>In this log, I speak about Search and Tag Clouds:</p>
<ul>
<li>Difference between Search and Tag Clouds</li>
<li>Similarity between Search Clouds and Search Autocomplete Suggestions</li>
<li>Potential to extend their usage for navigation</li>
<li>Challenge to mix search and tags Clouds together (usage versus document frequency)</li>
</ul>
<div class="flvPlayer">				<object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,0,0" width="640" height="500"><param name="movie" value="http://www.sylvainpaillard.com/wordpress/videos/flvplayer.swf?file=http://www.sylvainpaillard.com/wordpress/videos/2010-03-17 - tag-search-cloud.flv&amp;autoStart=false;" /><param name="quality" value="high" /><param name="wmode" value="transparent" /><embed src="http://www.sylvainpaillard.com/wordpress/videos/flvplayer.swf?file=http://www.sylvainpaillard.com/wordpress/videos/2010-03-17 - tag-search-cloud.flv&amp;autoStart=false;" quality="high" wmode="transparent" width="640" height="500" type="application/x-shockwave-flash" pluginspage="http://www.macromedia.com/go/getflashplayer" /><br />
				</object></div>

]]></content:encoded>
			<wfw:commentRss>http://www.sylvainpaillard.com/wordpress/?feed=rss2&amp;p=449</wfw:commentRss>
		</item>
		<item>
		<title>Tag clouds - what is at stake?</title>
		<link>http://www.sylvainpaillard.com/wordpress/?p=409</link>
		<comments>http://www.sylvainpaillard.com/wordpress/?p=409#comments</comments>
		<pubDate>Thu, 11 Mar 2010 05:35:12 +0000</pubDate>
		<dc:creator>Sylvain Paillard</dc:creator>
		
		<category><![CDATA[artificial intelligence]]></category>

		<category><![CDATA[customer experience]]></category>

		<category><![CDATA[Auto-tagging]]></category>

		<category><![CDATA[Context]]></category>

		<category><![CDATA[Search & Navigation]]></category>

		<category><![CDATA[Semantic Web]]></category>

		<category><![CDATA[Tag cloud]]></category>

		<guid isPermaLink="false">http://www.sylvainpaillard.com/wordpress/?p=409</guid>
		<description><![CDATA[In echo to the ever-increasing popularity of the tag-clouds, the emerging domain of auto-tagging aims to be the solution to populate these attractive visual components without requiring to tag each page individually. While many approaches try to solve this challenge, most of them do not address the real underlying technical challenges. But in order to [...]]]></description>
			<content:encoded><![CDATA[<p>In echo to the ever-increasing popularity of the tag-clouds, the emerging domain of auto-tagging aims to be the solution to populate these attractive visual components without requiring to tag each page individually. While many approaches try to solve this challenge, most of them do not address the real underlying technical challenges. But in order to evaluate how tag-clouds can deliver their full potential, we have to analyze what is at stake for the end-users and understand why tag-cloud could make a real difference in the way to access information.</p>
<p>In this new domain where art can meet with technology, visualization with data mining and repetitive manual efforts with automation, I felt it was interesting to inspect the different components and their roles; and to figure out what is new and what is old, what is solved and what is not, what is possible and what is pure fantasy&#8230;</p>
<p><img class="aligncenter size-full wp-image-410" title="wordle_tag_cloud" src="http://www.sylvainpaillard.com/wordpress/wp-content/uploads/2010/01/wordle_tag_cloud.jpg" alt="wordle_tag_cloud" width="600" height="375" /></p>
<p><span id="more-409"></span>Let&#8217;s first visit the different aspects involved in the generation of a beautiful tag cloud like the one in the picture here-above:</p>
<p><strong>The visual generator:</strong></p>
<p>The visual generator traditionally receives a list of weighted terms and display them visually. While most tag clouds order the list from the top-left to the bottom right according to a specified order (the most common case being the alphabetical order), others take a more artistic approach like the one shown above.</p>
<p>The fundamental role of all visual generators is pretty much always the same. Therefore, I will focus more on the usefulness of the generated tag-cloud for the user as a tool to access information. I will analyze it in different contexts, and in relation with the way the terms and their weights are generated.</p>
<p><strong>The content (i.e.: the weighted terms):</strong></p>
<p>The traditional approach of a tag cloud consist in analyzing all the tags of all the content of a web site (or of a content database) and to generate all the weights with a scoring formula as this one (from Wikipedia):</p>
<p><img class="tex" src="http://upload.wikimedia.org/math/a/c/3/ac38b08be1516a4a385d43cca5daef8a.png" alt="s_i = left lceil frac{f_{mathrm{max}}cdot(t_i - t_{mathrm{min}})}{t_{mathrm{max}}-t_{mathrm{min}}} right rceil" /> for <span class="texhtml"><em>t</em><sub><em>i</em></sub> &gt; <em>t</em><sub>min</sub></span>; else <span class="texhtml"><em>s</em><sub><em>i</em></sub> = 1</span></p>
<ul>
<li><span class="texhtml"><span style="font-family: Times New Roman;"><em>s</em><sub><em>i</em></sub></span></span>: display fontsize</li>
<li><span class="texhtml"><span style="font-family: Times New Roman;"><em>f</em><sub>max</sub></span></span>: max. fontsize</li>
<li><span class="texhtml"><span style="font-family: Times New Roman;"><span class="texhtml"><span style="font-family: Times New Roman;"><em>t</em><sub>i</sub></span></span></span></span>: count</li>
<li><span class="texhtml"><span style="font-family: Times New Roman;"><em>t</em><sub>min</sub></span></span>: min. count</li>
<li><span class="texhtml"><span style="font-family: Times New Roman;"><em>t</em><sub>max</sub></span></span>: max. count</li>
</ul>
<p>Then, all the tags are ordered according to their score and only the N top tags are considered to be displayed. Other variants are using a logarithm approach to calculate the score (instead of a linear approach like in this example). However, we can see that the term frequency (<span class="texhtml"><span style="font-family: Times New Roman;">ti</span></span>: count) is the key element of the scoring.</p>
<p><em>NB: In opposition, a Search-Cloud considers the frequency of the search to calculate the score, and not the number of results the search returns. Therefore, the visual generator of a Search-Cloud and of a Tag-Cloud is the same, but the content and its meaning is fundamentally different.</em></p>
<p><strong>Why the tag-cloud?</strong></p>
<p>Let&#8217;s analyze the  basic advantages of the tag-cloud as a visualization tool: while more structured ontology (e.g.: a tree of categories) have an inherent structure which is easy to apprehend for the end-user, a tag has a poor level of structure as it simply identifies a concept, but does not relate it with other concepts.</p>
<p>As a result, tags are a list of concepts, some of which are shared only by a very small number of elements, while other might cover the large majority of them. Therefore, the most frequent can be a quite good representation of the domain specific terminology of the web site. The goal of the tag-cloud is to show their relative importance as a new dimension in their visual presentation.</p>
<p>Such &#8220;horizontal view&#8221; gives a complete outlook of the most frequently referred concepts in a way which is easy to grasp visually. Where structured trees of category might fail to give an efficient access to many elements in the sub-nodes (either because the meaning of the first nodes is not intuitive, or because the user would not have thought the concept would be available at all), tag-clouds can succeed to give a solution, intuitive for anyone.</p>
<p><strong>In what contexts?</strong></p>
<p>While the traditional use of the tag-cloud is in a global context, it could also be used within a more specific context to further drill-down an initial category selection or a textual search (e.g.: after clicking on &#8220;student&#8221; in the tag cloud, a potentially very large list of result can be displayed, the tag cloud could then be refreshed with the tags and their score within this search context). This approach seems to make a lot of sense, but brings a certain amount of behavior which should be analyzed.</p>
<ol>
<li><strong>Synonyms</strong>: when two tags are synonyms, they are usually not likely to have the same association to the document (this is a general problem of tags, but it is emphasized by the usage of contextual tag clouds). If &#8220;automobile&#8221; and &#8220;car&#8221; are both tags in a web site, then even if they will have many documents in common, there will be a certain amount (maybe a minority) which will be exclusive for each. As a result, after selecting &#8220;automobile&#8221;, the most likely best scored tag will be &#8220;car&#8221;. This is a bad experience for two reasons: first it gives the impression to the user the coverage of the first tag is not complete (which is true, but not necessarily the feeling we should give to the user) and it pushes with the biggest score a repetition of the same concept which is likely to reduce only the list of result of a small fraction (if &#8220;automobile&#8221; provides 1000 results, then adding &#8221;car&#8221; might only reduce the list of results to 950 results). The experience is therefore bad from all perspective and it is clearly a case which should be avoided (one solution can be to consider the absolute distance to the mean 50% of the remaining documents, this way even if the synonyms are not removed, they appear with a much smaller size than the concepts which are very discriminant).</li>
<li><strong>Small list of results:</strong> while the list of results remains big enough, the statistical distribution of the tags is likely to stay meaningful, but when this number decreases (which is expected by the user to happen as quickly as possible), the fundamental advantages of the tag cloud is also reduced significantly. On the one hand, the results are likely to belong more an more to a very specific concept and the tags differentiating them are likely to become more and more unimportant or even irrelevant. If the user starts to feel the value of the tag-cloud decreases, it is likely that he will put the overall usefulness of the tag cloud in question. An easy solution is to remove the cloud when a minimum threshold is reached, but it might also disorient the user as he might not understand why it happens. The solution for this matter might be the subject for a future post in this blog as I didn&#8217;t find a valid solution for it so far.</li>
</ol>
<p><em>NB: Another interesting idea could be to consider te popularity and the tag (the number of times it is clicked on or entered as a search query) and not only the number of documents linked to the tag to define the scoring.</em></p>
<p><strong>The right terms are compounds:</strong></p>
<p>While the tag cloud picture here-above is visually attractive, it is not containing what I would consider to be the &#8220;right terms&#8221;. Terms like &#8220;use&#8221;, &#8220;however&#8221;, &#8220;much&#8221; are clearly irrelevant and do not bring any navigation value to the user (on the contrary, they can be quite misleading).</p>
<p>It turns out that the right terms are compounds and their sub-parts:</p>
<blockquote><p>The majority of domain specific terms are compound nouns, in other words, uninterrupted collocations. 85% of domain specific terms are said to be compound nouns. They include single-nouns of the remaining 15% very frequently as their components, where &#8220;single-noun&#8221; means a noun which could not be further divided into several shorter and more basic nouns. In other words, the majority of compound nouns consists of the much smaller number of the remaining 15% single-noun terms and other single-nouns. In this situation, it is natural to pay attention to the relation among single-nouns and compound nouns, especially how single-noun terms contributete to make up compound noun terms.</p>
<p><em>A simple but powerful automatic term extraction method - Hiroshi Nakagaway (University of Tokyo) and Tatsunori Mori (Yokohama National University)</em></p></blockquote>
<p>It is quite easy to understand why this makes sense. Let&#8217;s consider the following terms : &#8220;remote&#8221;, &#8220;power&#8221;, &#8220;control&#8221; and &#8220;off&#8221; and let&#8217;s imagine them distributed with many other terms in a tag cloud. Now let&#8217;s imagine compounds instead: &#8220;remote control&#8221; and &#8220;power off&#8221;. It is easy to see how much domain specific knowledge is contained in compounds, but are lost if they are split into single terms.</p>
<p><strong>Auto-tagging versus manual tagging:</strong></p>
<p>While I have tagged manually my blog to this day and do not complain about doing it, a significant increase in the amount of data might very well change my mind on the matter. Would many people write posts in my blog without being necessarily careful in tagging, I would then have an increasing feeling of uncontrollable chaos. An auto-tagging technique able to remove this trouble and avoid the synonym problems stated above, would then clearly be of great value.</p>
<p>I will not go into the details of Auto-tagging technologies in this post, as I believe it should be a debate on its own, but would simply like to state the key challenges at stake in Auto-tagging:</p>
<ul>
<li>The overall set of tags used should not be domain specific only, they should also differentiate contents between each other (e.g.: tags like &#8220;computer science&#8221;, &#8220;software&#8221; or &#8220;Internet&#8221; would have no value for my blog, even if they are domain specific, as they do not help identify a specific post (too general)). However, the statistical occurrences of &#8220;software&#8221; might be (by accident) misleading for a naive auto-tagging technology, to the point of making it look like a very good tag (maybe 50% of my posts use this word and 50% don&#8217;t)</li>
<li>A content can be linked to a tag even if there is no no occurrences of it in its textual content (e.g.: very often, we use words which are very important in a summary, providing the key topics involved, while these words do not appear in the content at all)</li>
</ul>
<p>Finding the right list of tags and connecting them to the right content should be considered as two separate processes. The first part of the problem should be to find the tags throughout all the content (possibly relying on external global semantic knowledge to do so) and the second to extend their mappings to the document (even if it is possible, in rare occasion, that a document, which has been the source of a selected tag, should not be connected to it).</p>

]]></content:encoded>
			<wfw:commentRss>http://www.sylvainpaillard.com/wordpress/?feed=rss2&amp;p=409</wfw:commentRss>
		</item>
		<item>
		<title>10 typical Search &#038; Navigation Dead-Ends</title>
		<link>http://www.sylvainpaillard.com/wordpress/?p=355</link>
		<comments>http://www.sylvainpaillard.com/wordpress/?p=355#comments</comments>
		<pubDate>Mon, 23 Nov 2009 07:21:11 +0000</pubDate>
		<dc:creator>Sylvain Paillard</dc:creator>
		
		<category><![CDATA[customer experience]]></category>

		<category><![CDATA[Dead-Ends]]></category>

		<category><![CDATA[Search & Navigation]]></category>

		<guid isPermaLink="false">http://www.sylvainpaillard.com/wordpress/?p=355</guid>
		<description><![CDATA[One of the key challenges in providing a positive customer experience online is to avoid the so-called &#8220;dead-ends&#8221;, defined as &#8220;the unsuccessful attempts to find a product or information&#8221; by Jacques Nantel, the well-known Professor of the Department of Marketing of HEC Montréal:




When shopping for a product or searching for information on a website, it [...]]]></description>
			<content:encoded><![CDATA[<p>One of the key challenges in providing a positive customer experience online is to avoid the so-called &#8220;dead-ends&#8221;, defined as <em>&#8220;the unsuccessful attempts to find a product or information&#8221;</em> by <a href="http://www.hec.ca/en/profs/jacques.nantel.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.hec.ca');" target="_blank">Jacques Nantel</a>, the well-known Professor of the Department of Marketing of HEC Montréal:</p>
<table border="0">
<tbody>
<tr>
<td width="210"><img class="alignleft size-full wp-image-372" title="deadend21" src="http://www.sylvainpaillard.com/wordpress/wp-content/uploads/2009/11/deadend21.jpg" alt="deadend21" width="201" height="250" /></td>
<td>When shopping for a product or searching for information on a website, it may take several attempts for consumers to find what they are looking for. It is suggested that these unsuccessful attempts to find a product or information, i.e., dead-ends, influence consumers’ perception and evaluation of the website usability. Results of this study, conducted with 204 consumers over two different sites, suggest that there is a negative relationship between the number of dead-ends experienced by consumers while shopping online and the perception of the website usability.  In addition, contrary to popular belief, a positive relationship was found between the number of pages visited by consumers and their evaluation of the website usability. [...]</td>
</tr>
</tbody>
</table>
<p style="text-align: right;"><a href="http://www.chairerbc.com/chairerbc/fichiers/deadends.pdf" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.chairerbc.com');" target="_blank"><em>The influence of Dead-Ends on perceived website usability</em></a></p>
<p>The approach described in the above reference relies on supervised test with over 200 users and is a statistical approach to identify and quantify the dead-ends in a collection of selected web-sites.</p>
<p>I will discuss here some simple methods which can be used to illustrate some Search &amp; Navigation Dead-Ends without any guarantee that they correspond to frequent dead-ends, nor that they represent the majority of the cases, but which do not require any statistical data to identify. The points here-under should therefore be only considered as a checklist to exemplify typical search &amp; navigation dead-ends to open the discussion with a new party on a concrete basis. Advanced statistical tests - like &#8220;Clickstream&#8221; described in the research above - are highly recommended to validate the significance of the cases of these methods.</p>
<p><span id="more-355"></span><strong>Back to the frustration causes:</strong></p>
<p>In my past post <a href="http://www.sylvainpaillard.com/wordpress/?p=302"  target="_blank">&#8220;Causes a frustration in the Search &amp; Navigation Experience&#8221;</a>, I indicated some of the key frustrating factors of the search experience: DOES NOT EXIST, TOO FAR, HIDDEN, UNDEFINABLE, UNDIFFERENTIATABLE, UNCOMPARABLE. These factors correspond to situations where the user feels in - or close to - a dead-end. I provide here-under 10 easy to identify cases where some these situation can happen.</p>
<p>Before starting, let&#8217;s define first the positive use cases with traditional search &amp; navigation today:</p>
<ol>
<li>The search keyword(s) used are represented in all of the desired results textual description exactly as is, and only in all of the desired results textual description exactly as is.</li>
<li>The categories provided bring me (after an intuitive and short sequence of steps) to the information I am interested in.</li>
<li>With some technology, a combination of both these cases (traditionally, the search step first and the category filtering step after).</li>
</ol>
<p><em>NB: A last word foreword: the cases here-under have been made simple and therefore often do not consider combinations. Such combinations are easy to do (e.g.: adding a keyword to the search which doesn&#8217;t change the problem, but make it more complex). In concrete illustration cases, it might be necessary to show combination cases, as customers will tend to defend their current system by saying that it is possible to find a category by one or a few clicks and therefore that the user should not search in these cases. This is often not a valid argument as the fact that people &#8220;shouldn&#8217;t&#8221; search in some cases doesn&#8217;t change the fact that they do, nor that they do not want to read a manual telling them when they should or shouldn&#8217;t search. But in order to simplify the presentation process, a combination case can immediately solve this issue as it is then clear it is not possible to have the same situation simply by browsing in the tree of categories.</em></p>
<p><strong>1. Long list of results<br />
</strong></p>
<p style="padding-left: 30px;"><strong>Pre-conditions: </strong>The user searches for something simple which generates a lot of results.<strong><br />
Symptoms:</strong> The user does not receive any relevant help to reduce this list by making a filtering selection.<strong><br />
Effects:</strong> The user will only look at the first page to see if there is (by chance) exactly what he is looking for and, if not, will either leave the web site or will try another search (refine his search by adding an additional keyword might bring him to the case 4)<strong>.<br />
Frustrations:</strong> TOO FAR, UNDEFINABLE<br />
<strong> How to find the right use cases:</strong><br />
- Search for one word (or one search concept) only<br />
- Search for the name of a category (or sub-category) you know contains a lot of products (tv, fridge, book, etc.)<br />
- Search for a famous brand, a color, or a property common to many products in the database</p>
<p><strong>2. Synonyms</strong></p>
<p style="padding-left: 30px;"><strong>Pre-conditions: </strong>The user searches for something which can be written in different ways.<strong><br />
Symptoms:</strong> The user receives one list of results if he types one word and another list if he types a synonym OR The user receives a list of results in one case and zero results in the other case.<strong><br />
Effects:</strong> The user will not necessarily understand what happens and can come to the conclusion that the product he is looking for is not offered in this web site. If the user does understand, he will be confused on what to do next to make sure he has seen all the possible results and will be frustrated to have to check all the different cases by himself. He will be less likely to make a purchase as he will never be sure to have seen all the possible options (maybe he forgot to try an important synonym). He might also not find the product he is looking for and leave the web site not knowing if<strong> </strong>the product is offered or not.<br />
<strong>Frustrations:</strong> HIDDEN, TOO FAR (not necessarily too many pages, but too many trials to do), DOES NOT EXIST<br />
<strong> How to find the right use cases:</strong><br />
- Close synonyms (e.g. “mobile phone” instead of “cell phone”)<br />
- Distant synonyms (e.g. “optical media” instead of “CD-R”)<br />
- Branded names (e.g. “Wi-Fi” instead of “802.11b”)<br />
- Different spellings (e.g. “ﬁle cabinet” instead of “ﬁling cabinet”)<br />
- Simple plurals (e.g. “screwdrivers” instead of “screwdriver”)<br />
- Complex plurals (e.g. “knives” instead of “knife”)<br />
- Acronyms (e.g. “SLR” instead of “Single Lens Reﬂex”)</p>
<p><strong>3. Misspelled words<br />
</strong></p>
<p style="padding-left: 30px;"><strong>Pre-conditions: </strong>The user searches for something but fails to type it correctly.<strong><br />
Symptoms:</strong> The user receives either no results or the wrong ones and is not properly helped to correct his spelling mistake<strong>.<br />
Effects:</strong> The user might try a few other variants (in the case he does understand that his problem was the spelling); he might try to find the products another way (other search and/or category browsing); but he is very likely to leave the web site without even knowing if the product is offered or not<strong>.<br />
</strong><strong>Frustrations:</strong> HIDDEN, TOO FAR, DOES NOT EXIST<br />
<strong> How to find the right use cases:</strong><br />
- Brand names (e.g. “Motorolla” instead of “Motorola”)<br />
- Category names (e.g. “Pilateese” instead of “Pilates”)<br />
- Product names (e.g. “Airon” instead of “Aeron”)<br />
- Product details (e.g. “Mahagony” instead of “mohagany”)<br />
- Author or artist names (e.g. “Bill Mar” instead of “Bill Maher”)</p>
<p><strong>4. Mixed specifications<br />
</strong></p>
<p style="padding-left: 30px;"><strong>Pre-conditions: </strong>The user searches for more than one criterion. <strong><br />
Symptoms:</strong> The user receives either no results or too many (some web site use an OR filter in such case)<strong>.<br />
Effects: </strong>The user will need to do another search with only one element, which might be a problem as the reason he is using mixed specifications might be because a search with one element generated two many results (loop between point 1 and point 4)<strong>.<br />
</strong><strong>Frustrations:</strong> HIDDEN, TOO FAR<br />
<strong> How to find the right use cases:</strong><br />
- Mixing gender and products (e.g. “Women’s sweaters)<br />
- Mixing details and products (e.g. “Red jacket”)<br />
- Mixing specials and products (e.g. “Skirts on sale”)</p>
<p><strong>5. Too general</strong><strong><br />
</strong></p>
<p style="padding-left: 30px;"><strong>Pre-conditions: </strong>The user searches for something with keywords that are present in many more results than the ones he is interested into<strong>.<br />
Symptoms:</strong> The user receives a very long list of products probably including what he is looking for, but potentially hidden within many irrelevant results. He has no appropriate filtering criteria suggested to him to remove the results he doesn&#8217;t want.<strong><br />
Effects:</strong> The user will need to do another search with one more search keywords (potential loop between point 1 and point 4)<strong>.<br />
</strong><strong>Frustrations:</strong> UNDEFINEABLE<br />
<strong> How to find the right use cases:</strong><br />
- Search for a category name which is present in many contexts (&#8221;accessories&#8221;, &#8220;sony computer&#8221; (might show also all the other computers having a &#8220;sony&#8221; memory card), &#8220;usb&#8221; (will show not only the keys, but also all the computer who have usb), etc.)</p>
<p><strong>6. Incomplete</strong><strong> coverage<br />
</strong></p>
<p style="padding-left: 30px;"><strong>Pre-conditions: </strong>The user searches for something with a specific keyword which is only present in the textual description of a small percentage of the desired results, even if a lot of the other products have it (by default, or written differently)<strong>.<br />
Symptoms: </strong>The user receives a list of results witch only contains some of the results he is looking for. He might come to the conclusion the products doesn&#8217;t exist or will realize he needs to make a more general search and analyse many more pages of results to be sure he has seen all the relevant products.<strong><br />
Effects:</strong> The user will need to do another search with less or other search terms<strong>.<br />
</strong><strong>Frustrations:</strong> HIDDEN, DOES NOT EXIST<br />
<strong> How to find the right use cases:</strong><br />
- Search for specific feature that you have identified is not always properly listed (e.g.: &#8220;HD-READY&#8221;, &#8220;6 channels&#8221;, &#8220;HDMI&#8221;, etc.)<br />
- Search for a specific binary feature (e.g.: if the description of the features contains &#8220;MULTI-LINGUAL : yes&#8221; will not provide the right search behavior unless specific manual configuration has been done)</p>
<p><strong>7. Too complex </strong><strong>guidance<br />
</strong></p>
<p style="padding-left: 30px;"><strong>Pre-conditions: </strong>The user searches for products having hard-to-understand features<strong>.<br />
Symptoms:</strong> The user receives search guidance criteria which are not easy for him to understand as he is not a specialist (e.g.: &#8220;How many Gos do you want for your MP3 player?&#8221;). He has no idea of the meaning of the filtering options he has and chooses one more or less randomly. Unless he is very lucky, he will need to come back<strong>.<br />
Effects:</strong>The user will need to make some guesses (does not make him confident that he is choosing the right product) or will ask for help<strong>.<br />
</strong><strong>Frustrations:</strong> UNDEFINEABLE, UNDIFFERENTIATABLE<br />
<strong> How to find the right use cases:</strong><br />
- Navigate in every category and look at what questions are asked in different context (typically focusing on the domains where some technical knowledge might be useful; e.g.: computers, digital camera, etc.)</p>
<p><strong>8. Parallel search</strong><strong>es<br />
</strong></p>
<p style="padding-left: 30px;"><strong>Pre-conditions: </strong>The user searches through several parallel searches (potentially due to any of the cases above)<strong>.<br />
Symptoms:</strong> The user will either use extensively the back&amp;forward buttons and will often have to redo the same searches (i.e.: the less-technical person) or will open many tabs with all the products he has selected in any of these lists (i.e.: the more technical person). The user will then not be able to use the product comparison feature and will need to do it by switching visually from one case to the other to look at what is different. If more than 2 products are selected, this process is very painful.<strong><br />
Effects:</strong> Unless the user comes to a purchase decision directly, he will not be tempted to come back to the web site for this purchase, as he will remember all the efforts he had to do to have the selected products compared  (and know he will need to do it all again).<strong><br />
</strong><strong>Frustrations:</strong> UNCOMPARABLE<br />
<strong> How to find the right use cases:</strong><br />
- If the web site doesn&#8217;t support multiple selection on the filter choices (results in different searches)<br />
- If the web site doesn&#8217;t propose enough filtering criteria (results in different pages)<br />
- If the web site doesn&#8217;t provide re-definable search criteria (after selecting a few criteria, the user might want to reconsider one to see what are the alternative, if he needs to start a new search, he will need to keep things in parallel)</p>
<p><strong>9. Partially satisfied searches</strong><strong><br />
</strong></p>
<p style="padding-left: 30px;"><strong>Pre-conditions: </strong>The user searches for several criteria which work individually, but not together as no product matches all of them.<strong><br />
Symptoms: </strong>The user will need to make many trials to understand which variations are supplied and which ones are not<strong>.<br />
Effects:</strong> Unless the user is both patient and technically advanced, he will give up. If he doesn&#8217;t, he will still have the parallel search issue at the end as several variants probably exist.<strong><br />
</strong><strong>Frustrations:</strong> UNDEFINEABLE, DOES NOT EXIST<br />
<strong> How to find the right use cases:</strong><br />
- Find a case of at least 3 criteria providing solutions 2 by 2 (at least for two cases), but not all together<br />
- Try to use global attributes (not specific to a category) (e.g.: COLOR (global) + BRAND (global) + CATEGORY (global)  - versus - CATEGORY (global) - SUB-CATEGORY #A (local) - COMPONENT ONLY AVAILABLE IN SUB-CATEGORY #A (even more local))</p>
<p><strong>10. Consequence of setting filters</strong><strong><br />
</strong></p>
<p style="padding-left: 30px;"><strong>Pre-conditions: </strong>The user selects some criteria without realizing that their selections filters out others which are necessary for him.<strong><br />
Symptoms: </strong>The system will remove the criteria which are no more valid and the user will realize that an important option is not offered anymore. The user does not necessarily remember (or simply understands) what action made these choices disappear and might come to the conclusion he selected &#8220;something wrong&#8221;.<br />
<strong> Effects:</strong> The user will need to go back and identify when he defined the search criteria which filtered out the products he really needs and chance his selection at this stage (that is, if he is able to)<strong>.<br />
</strong><strong>Frustrations:</strong> UNDEFINEABLE<br />
<strong> How to find the right use cases:</strong><br />
- Usually, brands have a hidden connection with prices. Selecting some brand often removes some price ranges<br />
- Unusual color or features often make some sub-category, heights, price or weights unavailable</p>
<p><em>NB: Several examples here-above are coming from this quite old, but quite useful and still quite relevant <a href="http://37signals.com/37searchreport.pdf" onclick="javascript:pageTracker._trackPageview('/outbound/article/37signals.com');" target="_blank">study</a>.</em></p>

]]></content:encoded>
			<wfw:commentRss>http://www.sylvainpaillard.com/wordpress/?feed=rss2&amp;p=355</wfw:commentRss>
		</item>
		<item>
		<title>Case Study - Corbeil Electroménagers launches a new E-Commerce web site that innovatively empowers the purchase experience and provides human-like expert guidance for their customers</title>
		<link>http://www.sylvainpaillard.com/wordpress/?p=338</link>
		<comments>http://www.sylvainpaillard.com/wordpress/?p=338#comments</comments>
		<pubDate>Tue, 13 Oct 2009 18:11:36 +0000</pubDate>
		<dc:creator>Sylvain Paillard</dc:creator>
		
		<category><![CDATA[customer experience]]></category>

		<category><![CDATA[E-Commerce]]></category>

		<category><![CDATA[Guidance]]></category>

		<category><![CDATA[Webcom]]></category>

		<guid isPermaLink="false">http://www.sylvainpaillard.com/wordpress/?p=338</guid>
		<description><![CDATA[I have spoken (again) as Keynote Speaker at the Webcom event in Montréal the 22nd of October 2009.
The main goal of this speech was to present a case study explaining how Corbeil Electroménagers has implemented a new CEM approach and technology on their E-Commerce web site (here are the slides I presented).


Keynote introduction:
We will present [...]]]></description>
			<content:encoded><![CDATA[<p>I have spoken (<a href="http://www.sylvainpaillard.com/wordpress/?p=204"  target="_blank">again</a>) as Keynote Speaker at the <a onclick="javascript:pageTracker._trackPageview('/outbound/article/www.webcom-montreal.com');" href="http://www.webcom-montreal.com/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.webcom-montreal.com');" target="_blank">Webcom event</a> in Montréal the 22nd of October 2009.</p>
<p>The main goal of this speech was to present a case study explaining how Corbeil Electroménagers has implemented a new CEM approach and technology on <a href="http://www.corbeilelectro.com" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.corbeilelectro.com');" target="_blank">their E-Commerce web site</a> (<a href="http://www.sylvainpaillard.com/wordpress/wp-content/uploads/2009/10/webcom-guidyu-22-10-09-corbeil.pdf" onclick="javascript:pageTracker._trackPageview('/downloads/wordpress/wp-content/uploads/2009/10/webcom-guidyu-22-10-09-corbeil.pdf');" target="_blank">here</a> are the slides I presented).</p>
<p><a href="http://www.sylvainpaillard.com/wordpress/wp-content/uploads/2009/10/corbeil.jpg"  target="_blank"><img class="alignleft size-medium wp-image-345" title="corbeil" src="http://www.sylvainpaillard.com/wordpress/wp-content/uploads/2009/10/corbeil-300x196.jpg" alt="corbeil" width="300" height="196" /></a></p>
<p><span id="more-338"></span></p>
<p><strong>Keynote introduction:</strong></p>
<p>We will present how Corbeil Electroménagers launched a new innovative technology to improve the customer experience on their E-Commerce web site.</p>
<p>We will demonstrate how we achieved to bring a more human-like online experience, by replacing the traditional trial &amp; errors into an empowering step-by-step guidance.</p>
<p>We will show how we solved the key search and navigation experience challenges at their roots: from suggesting guidance already in the search auto-completion, to helping refining and redefining the search context; from comparing similar products between manufacturers, to transferring qualified requests to the right human expert.</p>
<p>With this new environment, Corbeil has successfully transformed all the experience dead-ends into a continuous sequence of human-like navigation steps, each of which is only one step away from a purchase or contact action.</p>
<p>Finally, the continuous monitoring of the search behaviors and of the satisfaction of the customer intents enable Corbeil to continuously monitor the adaptation of their product offers and stocks status to the dynamic reality of customer demand.</p>
<p>- -</p>
<p>Here is an <a href="http://www.rezopointzero.com/2009/10/24/transformer-la-recherche-de-produits-en-experience-dachat/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.rezopointzero.com');" target="_blank">online press article </a>written about my presentation.</p>
<p>I have also done a <a href="http://www.sylvainpaillard.com/wordpress/wp-content/uploads/2009/10/webcom-guidyu-22-10-09-china.pdf" onclick="javascript:pageTracker._trackPageview('/downloads/wordpress/wp-content/uploads/2009/10/webcom-guidyu-22-10-09-china.pdf');" target="_blank">presentation</a> about doing business in China:</p>
<p><strong>Distributing your software in China: the real opportunities, the real challenges</strong></p>
<p>China&#8217;s market size and growth have everything to blow the most conservative minds, but why so many international firms invest while so few succeeds?</p>
<p>Sylvain Paillard started Guidyu&#8217;s operations in China in 2006 and has been based in Beijing for the last 2 years as General Manager for Greater China. Mr. Paillard manages the local R&amp;D, Sales and Administration teams and successfully secured the investment of Chinese risk capital in the company at the beginning of 2009. Guidyu has established partnerships with the biggest software companies in China and has been able to penetrate various verticals of the domestic industries: from brick and mortar industries like Banks and Telcos to innovative young Internet companies.</p>
<p>Mr. Paillard will visit some of the key aspects enabling a successful distribution of foreign software to and through local Chinese companies and will share some of the main challenges his company faced and how they have been overcome. He will also try to demystify some of the most bias ideas westerners frequently have about what it is to do business in mainland China.</p>

]]></content:encoded>
			<wfw:commentRss>http://www.sylvainpaillard.com/wordpress/?feed=rss2&amp;p=338</wfw:commentRss>
		</item>
		<item>
		<title>Customer Intent Analytics: the weak link between search and purchase</title>
		<link>http://www.sylvainpaillard.com/wordpress/?p=317</link>
		<comments>http://www.sylvainpaillard.com/wordpress/?p=317#comments</comments>
		<pubDate>Fri, 14 Aug 2009 10:41:19 +0000</pubDate>
		<dc:creator>Sylvain Paillard</dc:creator>
		
		<category><![CDATA[customer experience]]></category>

		<category><![CDATA[Customer Intent]]></category>

		<guid isPermaLink="false">http://www.sylvainpaillard.com/wordpress/?p=317</guid>
		<description><![CDATA[Many e-commerce companies have thousands of statistical reports, down from the number of page views and up to the sales trends, but how can they know if the purchase intent of their customers is satisfied properly? And how can they learn to do it better?
While the fundamental goal of most web analytics systems is to [...]]]></description>
			<content:encoded><![CDATA[<p>Many e-commerce companies have thousands of statistical reports, down from the number of page views and up to the sales trends, but how can they know if the purchase intent of their customers is satisfied properly? And how can they learn to do it better?</p>
<p>While the fundamental goal of most web analytics systems is to reports the KPIs (<a href="http://en.wikipedia.org/wiki/Key Performance Indicator" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Key Performance Indicator)</a> having the most effect on sales, these KPIs all tend to be usage driven instead of satisfaction driven.</p>
<p>The CEM (Customer Experience Management) consulting companies tends to supplement this problem by polluting the life of the customers by asking them to answer countless surveys, but fail to deliver precise and actionable conclusions, especially on the per case basis.</p>
<p>How can you capture your customer intents in order to see how well your product offering is adapted to it and presented properly?</p>
<p><img class="alignleft size-full wp-image-320" title="intent2" src="http://www.sylvainpaillard.com/wordpress/wp-content/uploads/2009/08/intent2.png" alt="intent2" width="415" height="288" /></p>
<p><em>Understanding the key parameters of customer intents for products which are not easy to find or not in the database is the key to ensure the satisfaction of all the significant purchase intents.</em></p>
<p><span id="more-317"></span></p>
<p>Statistical information like product page views and purchase counts give valuable information about the products matching the most common customers’ intents of purchase, but they do not provide any information about why the purchase happened, or help identifying missing product lines or products which are hard to find.</p>
<p>As the entire structure of e-commerce web sites is build around its existing content, the only real source of information of unsatisfied intents are the search logs, but it would be a mistake to consider them as a perfect representation, notably for the following reasons:</p>
<p style="padding-left: 30px;"><span style="text-decoration: underline;">1. Intents are flexible; search queries are not:</span></p>
<p style="padding-left: 30px;">Customers traditionally have a certain flexibility in the definition of their intents, which is not well adapted to the rigidity of the search query. As a result, the search queries tend to be either to general or too specific compared to the real intent.</p>
<p style="padding-left: 30px;"><em>For example, a customer searches first for “shampoo”, while he is looking for a shampoo of a high class brand, and will then do a new search with “shampoo l’oréal” due to too big amount of results provided by the first trial.</em><br />
<em><br />
</em><span style="text-decoration: underline;">2. Intents are subjective; search queries are not:</span></p>
<p style="padding-left: 30px;">Only the most objective parts of the customer intent are usually expressed, the customer will then evaluate if products matching his intent are in the list or not, but will never inform the system of what the intent really was (whether it is matched or not).</p>
<p style="padding-left: 30px;"><em>For example, a customer searches for “shampoo”, while he actually intent in buying an expensive shampoo for his wife. Only the objective part remains.</em></p>
<p style="padding-left: 30px;"><span style="text-decoration: underline;">3. Intents are personal; search queries are not:</span></p>
<p style="padding-left: 30px;">An intent is very comparable to a motivation: there are many different intents bringing to the same product, but for different reasons. Due to the impossibility to express an intent freely as a search query, only the people able to formalize objective and impersonal search queries will use the search tools enough to provide statistically relevant information.</p>
<p style="padding-left: 30px;"><em>For example, a customer who is looking for a shampoo to protect his sensitive hair and likes ones with tropical fruit smell will probably not know how to formalize his search query, but if he saw a commercial of it and remember the product name, he will simply type it.</em></p>
<p>Another critical problem of analysing and understanding the customer intent only with search logs and category access statistics is to identify the unsatisfied intent. First, many intent will not be expressed (as discussed above), and the only simple case which can be managed easily is the identification of the search queries providing no results, however, this approach has a very limited potential, partially for the following reasons:</p>
<p style="padding-left: 30px;"><span style="text-decoration: underline;">Many searches provide results, but many do not include any relevant result<br />
</span></p>
<p style="padding-left: 30px;">As the amount of textual information about products (description, features, reviews, comments, etc.) tends to become more and more important, it starts to be increasingly uncommon to face the zero results page. To some degree, one could argue that it is better, as showing a list of items which are at least partially relevant is better than no list at all. However, it is not the same from the side of the web analytics where we would like to differentiate the cases of success and failures.</p>
<p style="padding-left: 30px;"><em>For example, a customer searches for “l’oréal conditioner for sensitive hair”, but none of the products of the database matches this intent. The search will stil provide results with L’oréal conditioners as the others words can be found from the textual descriptions and the customer reviews (even if in a negative sense, e.g.: “not for sensitive hair”). It is then almost impossible for the company to identify that this request was not properly answered, even if it provided results.<br />
</em></p>
<p style="padding-left: 30px;">Unfortunately, differentiating the searches bringing sales from the others cannot help either, as the one bringing sales are very clean, but not helpful, and the others tends to be extremely noisy and most of the time unusable (as many search queries providing valuable results will be there as well).</p>
<p>In this post, I try to demonstrate that the need of a new understanding level of the search queries is not only beneficial for the end-customer (as discussed in prior and probably future posts), but is absolutely necessary to report and analyse customer intents properly.</p>
<p>In this process, two components are critical for the success:</p>
<p>1.    Have a technology which is able to capture the customer intent in a structured way<br />
2.    Educate the customers so they know they can express their intent freely and should expect the web site to answer them better over time due to the intent analysis and optimization outcomes</p>
<p><em>PS: All the examples are coming from a current case I am working on (<a href="http://www.perfecthair.ch" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.perfecthair.ch');" target="_blank">www.perfecthair.ch</a>), so don’t worry, it’s not that I am trying to sell shampoo, it’s just my current work which is a good source of examples and a motivation for this post <img src='http://www.sylvainpaillard.com/wordpress/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </em></p>

]]></content:encoded>
			<wfw:commentRss>http://www.sylvainpaillard.com/wordpress/?feed=rss2&amp;p=317</wfw:commentRss>
		</item>
		<item>
		<title>Causes of Frustration in the Search &#038; Navigation Experience</title>
		<link>http://www.sylvainpaillard.com/wordpress/?p=302</link>
		<comments>http://www.sylvainpaillard.com/wordpress/?p=302#comments</comments>
		<pubDate>Mon, 13 Jul 2009 09:54:22 +0000</pubDate>
		<dc:creator>Sylvain Paillard</dc:creator>
		
		<category><![CDATA[customer experience]]></category>

		<category><![CDATA[Search & Navigation]]></category>

		<guid isPermaLink="false">http://www.sylvainpaillard.com/wordpress/?p=302</guid>
		<description><![CDATA[On the web, regardless of the reason why we can&#8217;t find what we are looking for, our emotion is pretty much the same: we get frustrated, more and more rapidly over time.
As a result, two potential reaction are usually adopted:

I prefer not receiving these services anymore until there is something better out there
I will use [...]]]></description>
			<content:encoded><![CDATA[<p>On the web, regardless of the reason why we can&#8217;t find what we are looking for, our emotion is pretty much the same: we get frustrated, more and more rapidly over time.</p>
<p>As a result, two potential reaction are usually adopted:</p>
<ul>
<li>I prefer not receiving these services anymore until there is something better out there</li>
<li>I will use it for now, but I will definitely quit as soon as there is something better out there</li>
</ul>
<p>The type of reaction is usually more linked to the need we have for the service than our level of frustration (i.e.: we can be extremely frustrated, but continue to force ourselves to continue if we really need to find the information, while a slightly disengaging experience turns us off quickly if the service is not critical for us).</p>
<p>In this post, I try to identify the sources of these frustrating experiences when related to search &amp; navigation  and discuss the ways they can be solved.</p>
<p><img class="alignleft size-full wp-image-303" title="bad_ecommerce_experience" src="http://www.sylvainpaillard.com/wordpress/wp-content/uploads/2009/07/bad_ecommerce_experience.jpg" alt="bad_ecommerce_experience" width="425" height="343" /></p>
<p><em>As we can see in this diagram, a negative experience can be worse than no experience at all!</em></p>
<p><em><span id="more-302"></span></em>Defining properly the real cases of frustration of the online experiences is a much more complex task than it seems. Indeed, a definition like &#8220;I can&#8217;t find what I am looking for&#8221; is not quite precise enough to come up with appropriate solutions.</p>
<p>Here are the most common cases as I see them, which I divided in two groups:</p>
<p><span style="text-decoration: underline;">A: I know what I want:</span></p>
<ul>
<li>DOES NOT EXIST: The information I am looking does not exist on the web site.</li>
<li>TOO FAR: The information I am looking for is in a list of many results, but I don&#8217;t know where</li>
<li>HIDDEN: The information I am looking for exists but is not included in the result lists of my searches/navigation</li>
</ul>
<p><span style="text-decoration: underline;">B: I don&#8217;t know what I want (at least not exactly):</span></p>
<ul>
<li>All the points of A</li>
<li>UNDEFINABLE: I don&#8217;t know how to define what I am looking for (maybe it is there, maybe not)</li>
<li>UNDIFFERENTIATABLE: I don&#8217;t know how to identify the information which I want out of others which seem fully similar</li>
<li>UNCOMPARABLE: I found something, but it is impossible (too hard/long) to make sure it is the most adapted option for me</li>
</ul>
<p>Many other points could be added in consideration of the reasons people decide not to buy on e-commerce portals, but I will only focus here on the frustration of the user, not of the seller <img src='http://www.sylvainpaillard.com/wordpress/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
<p>Also, other points could be related to the frustration of exploring the content of the information (not sufficient, too complex, too long, etc.), but I will only focus here on the information access problematic.</p>
<p>We can see that the experience of someone who does not know what he is looking for exactly is much more likely to be frustrating than the one of someone who does (which is quite intuitive). This explains to some degree why it is harder to deliver a good online customer experience in a Gift Shop than in a Office Supply Store.</p>
<p><span style="text-decoration: underline;">DOES NOT EXIST:</span></p>
<p>This is probably one of today&#8217;s biggest differences between the online and the offline world. When you go to an electronics store and ask for a well known product like the IPhone, they will immediately let you know if they don&#8217;t sell it (which is often the case, as the distributor are mainly the telco operators).</p>
<p>Strangely, when you go on stores like Amazon, you have no indications that the products does not exist after searching for it and you might spend (like me) up to 30 minutes until realizing that they actually don&#8217;t sell it. You will even find yourself encouraged to do so by the auto-completion search suggestions which invites you to type: &#8220;Iphone&#8221;, &#8220;Iphone 3G&#8221;, &#8220;IPhone 3G unlocked&#8221;, &#8230;; clearly misleading as it strongly imply that the Iphone is a well-known product in their offer.</p>
<p>Identifying that the customer is looking for something which you don&#8217;t have (or simply that you have, but not as part of your online collection) is a crucial part of providing a positive customer experience. Inviting the customer to be contacted when the situation changes is also a good idea (by e-mail, or by calling him).</p>
<p><span style="text-decoration: underline;">TOO FAR - HIDDEN: The great pleasures of trial &amp; errors:</span></p>
<p>Another way to explain this issue is UNDERDEFINED - OVERDEFINED:</p>
<blockquote><p>PC under 500$ : 3&#8242;000.- results</p>
<p>with bluray burner : 0 results</p>
<p>with a hard drive bigger than 50 Go : 2&#8242;950 results</p>
<p>with &gt; 19&#8221; screen : 2 results I don&#8217;t like</p>
<p>with CPU &gt; 2Ghz : 2&#8242;850 results</p></blockquote>
<p>This is a pretty understandable problem, as most manufacturers focus on the biggest demand and therefore the most needed features are present in almost every models. Many other aspects differentiate them, but most of them are not part of the important need of the customer (for the exact same reason). Therefore, the TOO FAR - HIDDEN scenario can last for a long time, and the user will need to open many tabs on his browser not to loose the products he wants to consider. He will need to switch from one to the other to have a way to compare (not exactly what was intended by the &#8220;compare&#8221; features) and will lose everything when he exits his browser.</p>
<p>Making sure the user can have an efficient view on the set of parameters which matters for him is key to avoid this vicious frustration cycle.</p>
<p>Depending on the case, this selection can be done explicitly by the user (which brings a certain amount of complexity in the user interface) or induced more intelligently in the back (reducing the freedom of change of the user). There is no magic formula, but there is definitely a need.</p>
<p><span style="text-decoration: underline;">UNDEFINABLE:</span></p>
<p>Try to imagine your mother buying a computer online and you will have a good picture of what undefinable means: while certain people will be able to know what to ask for in terms of CPU, Memory, or Hard drive, defining these parameters can be mission impossibles for others. Their requests are much more subjective and define the usage they want to have with the device, instead of the required properties it must have.</p>
<p>The typical approach nowadays to solve this challenge is to force customers through a predefined path providing simple to understand explanation (e.g.: &#8220;What hard drive do you want? <em>A hard drive is where you can store all your data (pictures, documents, etc.)</em>&#8220;,[ "100 Go (up to 30'000 songs or 100 movies)", "250 Go (up to 75'000 songs or 300 movies)", "500 Go (up to 150'000 songs or 500 movies)"]).</p>
<p>However, a better way would be to provide a completely different type of search criteria to the users who need them, with parameters like the desired usage: office work, image editing, video montage, 3D gaming, etc.</p>
<p>Adding subjective confirmation on the product can confirm the impression of having found the right ones (e.g.: &#8220;This model is perfectly suited if you look for a entry price desktop computer to do office work&#8221;, &#8220;This model is powerful enough to edit all your family videos with great visual effects like in the movies!&#8221;, &#8230;)</p>
<p><span style="text-decoration: underline;">UNDIFFERENTIATABLE:</span></p>
<p>There is nothing more frustrating than to have the feeling that 3 products are just exactly the same. In some cases, increasing the size of the database in order to offer a bigger product line can backfire for the customer experience. Customers had no problems to find the right products in the past and are now confronted with big lists of products which all look similar.</p>
<p>This problem can also give the customer the impression that the provider is not able to select the good product offering for him and just pushes everything to make more money. In such case, the price tends to be perceived as the only competitive value (versus the expertise of choosing the right products for the customer target segment).</p>
<p>The best way to solve this problem is to complete the product properties to provide the final differences (and possibly to invite the customer to narrow down his selection with these new criteria).</p>
<p>Another common approach is to provide CGC (Customer Generated Content), not requiring any manual effort on the side of the company and showing an &#8220;objective&#8221; evaluation of the product to let the customer increase its confidence to identify the right product out of a small selection.</p>
<p>On the other hand, it has the limitation to be very general (if 50% of the people like the products and 50% don&#8217;t like it, the ranking will not give anything relevant), and to push a potentially heavy analysis work on the side of the customer (having to read hundreds of reviews, most of them providing no interesting content). On top of this, reviews can contain information implying a bad after sales experience with the provider, which might be difficult to find and moderate.</p>
<p>The most sophisticated approach I have seen so far is CGC over CGC (or like I like to call it: CGC²): let the customers vote for the most useful customer reviews (the one saying the product is good and the other one that it is not). It is then likely that customers will limit their reading to these and it enable companies to only moderate a small percentage of the total amount of reviews.</p>
<p><span style="text-decoration: underline;">UNCOMPARABLE:</span></p>
<p>The problems related to comparing products are quite similar to the one of differentiating a small selection of them. However, the experience (and frustrations) can be quite different. In that way, the comparability is more tricky in terms of customer experience than the problem of differentiation.</p>
<ul>
<li>If the customer finds 3 products and needs to differentiate them to identify the best one, the coverage is extremely simple: it&#8217;s these 3 products.</li>
</ul>
<ul>
<li>If the customer finds one product he likes, but want to see if there is anything else he should consider, this coverage is fully open-ended.</li>
</ul>
<p>Unlike what some providers might think, this problem is not solved by pushing more compelling marketing message on the product detail page, as the challenge is not to be convince of the product value, but to remove the fear that there might be something better which the customer didn&#8217;t find so far.</p>
<p>The reason why this aspect is extremely important in E-Commerce is that if the customer is not sure that the site has showed him all the products he should consider, he will be likely to look for other possibilites on other web sites. In that logic, ensuring a good comparison experience between the different products can lower the need to compare with other providers.</p>
<p>There are several approaches to help the customer find other products for comparison purposes:</p>
<ul>
<li>Prepare a list of similar products for each product
<ul>
<li>advantage: no action required from the user</li>
<li>disadvantage: customer has no idea how this list was computed and might not feel it is complete nor adequate for him</li>
</ul>
</li>
<li>Provide dynamic search filters (with counters)
<ul>
<li>advantage: customer can easily decide which parameters are important to keep and which ones are not (as he can decide which filter he wishes to modify). Counters are important as many combination can lead to zero results and should not have to be tried.</li>
<li>disadvantage: none really, except it requires purchasing advanced technologies</li>
</ul>
</li>
<li>Escalate to the contact center
<ul>
<li>advantage: the customer might just need to hear from a real person that he is making a good choice</li>
<li>disadvantage: none really, except the cost of the calls and that not everyone will want to chose this option</li>
</ul>
</li>
</ul>
<p>Comparison is one of the most important issues in the e-commerce world today. The focus of the industry has been put on the meta-portal showing the difference of price between providers for the same product, but not in finding the adequate products to compare to within the product offering of one provider. This is another example of how the Internet world has accomplished things that were not possible in the brick &amp; mortar world, but totally left aside other aspects which have been  proven extremely important in the real world, for the reason that nobody really knows how to handle it properly.</p>
<p><strong>Conclusion:</strong></p>
<p>This post is not the final conclusion on this topic (far from it) and, if anything, it is more intended to show where the issues are coming from and to group them into a short 6 points list. I will elaborate more on the possible solution CEM (Customer Experience Management) can provide in terms of Information Access for each one of them more in depth in the future, but I hope you can already find the above useful and understand a little better what is at stake when someone tells you that you have customer experience issues with your web site.</p>

]]></content:encoded>
			<wfw:commentRss>http://www.sylvainpaillard.com/wordpress/?feed=rss2&amp;p=302</wfw:commentRss>
		</item>
		<item>
		<title>The tumultuous history of Dialog Systems</title>
		<link>http://www.sylvainpaillard.com/wordpress/?p=216</link>
		<comments>http://www.sylvainpaillard.com/wordpress/?p=216#comments</comments>
		<pubDate>Tue, 09 Jun 2009 06:56:39 +0000</pubDate>
		<dc:creator>Sylvain Paillard</dc:creator>
		
		<category><![CDATA[artificial intelligence]]></category>

		<category><![CDATA[Dialog systems]]></category>

		<guid isPermaLink="false">http://www.sylvainpaillard.com/wordpress/?p=216</guid>
		<description><![CDATA[Chinese version - 中文
The idea of a Dialog System is probably as old as the field of computer science itself.  It is hard to know if Charles Babbage already thought about it in the 1830s when he created his  Analytical Engine and then his Difference Engine; but it is clear that Alan Turing set the [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.sylvainpaillard.com/wordpress/?page_id=284" ><em>Chinese version </em><span style="font-family: Arial;">- 中文</span><em></em></a></p>
<p>The idea of a <a href="http://en.wikipedia.org/wiki/Dialog System" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Dialog System</a> is probably as old as the field of computer science itself.  It is hard to know if <a href="http://en.wikipedia.org/wiki/Charles Babbage" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Charles Babbage</a> already thought about it in the 1830s when he created his  <a href="http://en.wikipedia.org/wiki/Analytical Engine" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Analytical Engine</a> and then his <a href="http://en.wikipedia.org/wiki/Difference Engine" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Difference Engine;</a> but it is clear that <a href="http://en.wikipedia.org/wiki/Alan Turing" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Alan Turing</a> set the definition of the ultimate Dialog System when he described the <a href="http://en.wikipedia.org/wiki/Turing Test" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Turing Test</a> in his paper <a href="http://en.wikipedia.org/wiki/Computing Machinery and Intelligence" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Computing Machinery and Intelligence</a> in 1950.<br />
<img class="alignleft size-full wp-image-222" title="turing_test_version_3" src="http://www.sylvainpaillard.com/wordpress/wp-content/uploads/2009/06/turing_test_version_3.png" alt="turing_test_version_3" width="250" height="320" /><br />
<a href="http://en.wikipedia.org/wiki/File:Turing_Test_version_3.png" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target="_blank"><em>From Wikipedia</em></a><em> -   The &#8220;standard interpretation&#8221; of the Turing Test, in which player C, the interrogator, is tasked with trying to determine which player - A or B - is a computer and which is a human. The interrogator is limited to only using the responses to written questions in order to make the determination.</em></p>
<p>Turing predicted that machines would eventually be able to pass the test and that 30% of human judges would be fooled in a five-minute test by the year 2000.  Futurist <a href="http://en.wikipedia.org/wiki/Raymond Kurzweil" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Raymond Kurzweil</a> updated it to 2020 in 1990 and revised it to 2029 in 2005.</p>
<p>This last prediction appears to me as uncertain as any of the prior ones, but many interesting Dialog Systems have been developed already and, thankfully, the market does not need the Turing Test to be passed to start adopting them.</p>
<p><span class="summary" style="display: none;">Turing Test version 3.png</span></p>
<p><span id="more-216"></span><strong>The fundamental difference between Chatterbots and Dialog Management:</strong></p>
<p>Before providing some elements about the history of Dialog Systems since 1950, it is important to consider that two different trends have been pursued throughout the past decades: the one of simulating a dialog in appearance (which I will call the Chatterbot approach) and the one of modelling a real understanding of the dialog and generating the appropriate answers dynamically (which I will call the Dialog Management approach). We can find the reason for this co-existence directly in the definition of the Turing Test which considers only the impression of validity of the answers provided and not any other sort of proof  of understanding.</p>
<p>In the reality, the developed system sometimes mix both approaches, but one almost always has a clear predominance on the other (and to some degree, one can claim that a Chatterbot has a Dialog Manager inside, even if it usually based on simple pattern matching rules). Let me give an simple example to compare what I mean:</p>
<p><em>The user asks: &#8220;Can you buy me a bottle of milk?&#8221;</em></p>
<p><em>In a Dialog Management approach, the computer could (but this is just an example), build an model of this kind: [type:Question; action:buy;interrogation:ability to perform action;object:bottle of milk] on which it would query what would have to be a fairly complex knowledge reference-base to answer, for instance, &#8220;no, because I have no money&#8221;, or possibly to ask a question: &#8220;It depends, can you give me some money?&#8221;.<br />
</em></p>
<p><em>In a Chatterbot approach, this internal processing doesn&#8217;t exist and a predefined answer is selected through quite simple rules (&#8221;Can you*?&#8221; =&gt; list ["yes, of course", "no, i can't", "no, I don't want to"]). The answer is often randomly selected in the list. The answer can appear to make sense, however, the system has no real understanding of the question, he just fakes to be able to perform a dialog.</em></p>
<p>It is easy to understand that the Chatterbot approach simply doesn&#8217;t make any sense if the goal is to perform a real action and not just to provide an answer (how could a system perform any meaningful action by faking the understanding of the requests of the users?). In my view, it doesn&#8217;t make sense when it comes to textual conversation either, because the fundamental limitations of Chatterbots are too great to provide any sustainable value, even when only answering questions.</p>
<p>However, the &#8220;wow&#8221; affect of good demo cases is so great that, repetitively during the last 50 years, many people have been duped  by the false impression that a free dialog system could work efficiently with a Chatterbot approach. Billions of dollars have been spent in pure vain and, to my great desperation, I predict more will be spent in the future, until the Dialog Management provides sufficient results to simply eradicate this this shameful error in the evolution of computer science.</p>
<p><strong>The history of Chatterbots</strong><strong>:</strong></p>
<p>Everything really started in the 60s, when <a href="http://en.wikipedia.org/wiki/Joseph Weizenbaum" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Joseph Weizenbaum</a> developed <a href="http://en.wikipedia.org/wiki/Eliza" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Eliza</a> in the MIT, which is commonly referred to as the first Chatterbot. The most famous program of Eliza was the DOCTOR script, which provided a &#8220;parody&#8221; of the responses of a non-directional psychotherapist in an initial psychiatric interview. The irony was that, even if Weizenbaum believed the system had a great interest due to the emotional reaction it created on people, he never really considered it as a solid base for more intelligent systems.  Even more so, the multitude of meaningless discussions and wrong conclusions Eliza created in the society pushed him to write the book <em>Computer Power and Human Reason: From Judgment to Calculation</em>, which argued that the misuse of artificial intelligence has the potential to devalue human life. But people preferred to play with Eliza rather than to read his book and, as in Pygmalion (from which, due to another irony, he chose the name Eliza), he had no control on his creation. If people wanted to believe in the potential of a Chatterbot, even without any scientific justification, it couldn&#8217;t be stopped. 50 years later, as I write this post, many companies tried to convince people to buy pattern matching-based systems, even if many historical cases should remind us they are doomed to fail miserably.</p>
<p>6 years later, in 1972, psychiatrist <a href="http://en.wikipedia.org/wiki/Kenneth Colby" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Kenneth Colby</a> created <a href="http://en.wikipedia.org/wiki/PARRY" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>PARRY</a> at Stanford University. The basis were the same as for Eliza, but Parry tended to simulate a paranoid schizophrenic, instead of a psychotherapist. Even if Colby made a much more serious effort, the result was fully similar. As you can imagine, the connection between Eliza (the psychotherapist) and Parry (the schizophrenic) was inevitable, <a href="http://tools.ietf.org/html/rfc439" onclick="javascript:pageTracker._trackPageview('/outbound/article/tools.ietf.org');" target="_blank">here is the result of their meeting at the ICCC in 1972</a>. As it could have been guessed, connecting two stupid Chatterbots together didn&#8217;t result in anything great&#8230;</p>
<p>The field didn&#8217;t get more serious with <a href="http://en.wikipedia.org/wiki/Racter" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Racter,</a> and <em>The Policeman&#8217;s Beard Is Half Constructed</em>, a book the creators, <a href="http://en.wikipedia.org/wiki/William Chamberlain" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>William Chamberlain</a> and <a href="http://en.wikipedia.org/wiki/Thomas Etter" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Thomas Etter,</a> pretended had been fully written by Racter. The result was impressive, but the program was never released to the general public. One year later, when <a href="http://en.wikipedia.org/wiki/Mindscape" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Mindscape</a> released a Chatterbot version of Ractor, it became clear for everyone that Ractor was far less sophisticated than anything that could have written the fairly prose of <em>The Policeman&#8217;s Beard</em>. The story is still not very clear today, but it seems obvious Chamberlain and Etter created huge data files containing most of the text of the book, which Ractor just &#8220;joint&#8221; together.</p>
<p>As early as 1978, Michael Mauldin was one of the first person who tried to bring some reasoning into a Chatterbot, when he created PET, a Chatterbot able to posit new information and became famous for the following Dialog:</p>
<p><em>Subject:                I like my friend<br />
(later)<br />
Subject:                I like food.<br />
PET:                       I have heard that food is your friend.<br />
</em></p>
<p>Then, Mauldin created Virtual Personalities (now <a href="http://www.conversive.com/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.conversive.com');" target="_blank">Conversive</a>) and two famous Chatterbots: Sylvie (1994) and Julia (1997).  The key aspects provided by Maulding and Peter Plantec (the other founder) was, in addition to provide incorporated animation and synthetic voice, the ability to explore a virtual world (e.g.: a web site) and use the gathered information in a dialog. In this sense, Mauldin try to open a way out of the pattern matching approach, realizing the knowledge had to come from outside and couldn&#8217;t be a pre-formatted data file as for ELIZA. Another interesting aspect is that Mauldin is also the founder of Lycos, a search engine which was initially an extrapolation of Julia. Mauldin is also the inventor of the term <a href="http://en.wikipedia.org/wiki/ChatterBot" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>ChatterBot</a> in 1994 (as a synonym for Artificial Conversational Entity (ACE))</p>
<p>In 1990, the <a href="http://en.wikipedia.org/wiki/Loebner Prize" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Loebner Prize</a> contest was created as an annual competition in artificial intelligence that awards prizes to the <a href="http://en.wikipedia.org/wiki/Chatterbot" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Chatterbot</a> considered by the judges to be the most human-like, following the same format as the Turing test. The Loebner Prize does not require the Dialog Systems to be based on a pattern matching approach, and therefore, the day reasoning based system will work, they will be able to prove their ability. However, this contest do not reward the sophistication of the approach, but only the result, by following casual chatting scripts and evaluating the relevance of the result. As an effect, systems providing quick results are destined to be rewarded, rather than more serious efforts which would try to solve one small aspect at a time.</p>
<p>Another significant player in the Chatterbot history is <a href="http://en.wikipedia.org/wiki/Richard Wallace" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Richard Wallace,</a> the founder of  A.L.I.C.E. (Artificial Linguistic Internet Computer Entity). Wallace took a different approach which was quite rewarded as Alice won the Loebner Prize 3 times (2000, 2001 and 2004). His approach went back to a purely pattern matching one, but he created an XML Schema called <a href="http://en.wikipedia.org/wiki/AIML" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>AIML</a> (Artificial Intelligence Markup Language) for specifying the heuristic conversation rules. The advantage of this approach was that it was easy to create and share knowledge in an AIML file, as well as to load many AIML files together to have a &#8220;smarter&#8221; bot.</p>
<p>My opinion is that all these efforts on Chatterbots based on pattern matching are a monumental waste of time and money (as we will see here-after, to the level of billions of dollars). You don&#8217;t believe me? <a href="http://www.chayden.net/eliza/Eliza.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.chayden.net');" target="_blank">try Eliza</a> and compare it with 2008 <a href="http://en.wikipedia.org/wiki/Loebner Prize" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Loebner Prize</a> winner <a href="http://www.elbot.com/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.elbot.com');" target="_blank">Elbot</a>. Tell me how you really believe that these 50 years of efforts were worth it. Were we really digging at the right place?</p>
<p><strong>Chatterbots in the business world:</strong></p>
<blockquote><p>The average lifetime of commercially employed chatterbots is restricted to only 6 month.<br />
<em>Forrester Research</em></p></blockquote>
<p>One of the most fascinating story related to how large companies believed in the potential of Chatterbots is the case of <a href="http://en.wikipedia.org/wiki/Artificial Life" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>Artificial Life,</a> a company founded in 1994 which was able to sell custom-made Chatterbots applications to companies like Credit Suisse First Boston, Price Waterhouse Coopers and UBS. The company still exists, and is actually doing quite well, but now in a totally different field (mobile gaming), as, after the Internet bubble burst, they lost more or less all their market.</p>
<p>What is interesting is that the company was able to become public on the NASDAQ (<a href="http://finance.yahoo.com/echarts?s=ALIF.OB#symbol=ALIF.OB;range=my" onclick="javascript:pageTracker._trackPageview('/outbound/article/finance.yahoo.com');" target="_blank">ALIF</a>)  in 1998 and the market capitalization of this company reached a stock value of over $38 in February 2000. In June 2003, the stock was only worth $0.05, or 760 times less 3 years before. What is interesting is that the market capitalization when the stock had a value of $38 was over 1.8 billion USD. This value was less than 2.5 million USD 3 years later.</p>
<p>Artificial life is not the only case, but it is, to my knowledge, the biggest one ever. All this being said, I have to express my highest admiration to their executive team, and more particularly to their founder, Eberhard Schoneburg, who is still their current CEO, not only for having created such an amazing value in the domain of Chatterbot (even if for a short time), but more especially for succeeding in turning the company around, moving it to Hong Kong and having the second highest mobile penetration rate in the world.</p>
<p>Another famous example is the company Ask Jeeves (now Ask.com), who was able to convince Dell to adopt &#8220;Ask Dudley&#8221; for the online technical support in 1998. Ask Jeeves capitalized quite strongly on its natural language capacities with this Chatterbot-based technology and was able to grow quite well until 2000, reaching $58 million in sales. From a high of $190 per share in 1999, the company&#8217;s stock began spiraling downward, falling to just $.86 per share by 2002. Stuck with a technology that simply didn&#8217;t have what was necessary to perform properly, Ask Jeeves found a way out by purchasing a search engine company called Teoma Technologies. In 2005, the company announced plans to phase out Jeeves. On February 27, 2006 the character disappeared from Ask.com.</p>
<p>However, both these cases did show pretty strong sales successes compared to the average case in the world of Chatterbot as, in many cases, only grants funding are able to justify the costs of their implementation:</p>
<blockquote><p>&#8220;Most of the German bots were build with funds from grants.&#8221;<br />
<em>A Trend from Germany: Library Chatbots in Digital Reference</em></p></blockquote>
<p>The Chatterbot technology is inherently stunted in it’s evolution and it’s effectiveness due to the approach and fundamental basis of Chatterbot technologies idea of pattern matching… The fact that there is a completely faked but nevertheless appearance of some form of Artificial Intelligence to the user may make for a sexy sale… but the smoke and mirrors approach in offering customers a new ‘feature’ with very limited value, if any at all, has been historically proven that the road to failure is a short one…</p>
<p><strong>The history of Dialog Management</strong><strong>:</strong></p>
<p>Dialog Systems based on Reasoning, in opposision to Chatterbots, try to do much less, but with much more control. As a result, their implementations are usually quite focused to a specific domain requiring specific actions. Even if the history of Dialog Management has not been exposed to the market as widely as Chatterbots, they also have an interesting history.</p>
<p>The first reference I found of Dialog Management being really used is in 1986, in the article &#8220;Dialog management for gestural interfaces&#8221; of IBM. Of course, many efforts have been done before, but not really on a separated module defined as a Dialog Manager (or at least, not to my knowledge).</p>
<p><a href="http://en.wikipedia.org/wiki/ Carnegie Mellon University" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'> Carnegie Mellon University</a> (CMU) is probably one of the research center which has been the most active in Dialog Management during the last 20 years, especially since the AGENDA dialog manager of Wu &amp; Rudnicky in 1999. In 2003, Bohus &amp; Rudnicky created the RavenClaw, which is now the standard Dialog Manager of the <a href="http://wiki.speech.cs.cmu.edu/olympus/index.php/Olympus" onclick="javascript:pageTracker._trackPageview('/outbound/article/wiki.speech.cs.cmu.edu');" target="_blank">Olympus Dialog System Framework</a>, the CMU architecture for spoken dialog system.</p>
<p>Such architecture already show very impressive results, not only limited to the scope of the Dialog Manager, but throughout the entire flow of the Dialog System (speech recognition, natural language processing, dialog management, output generation and text to speech). I am personally particularly impressed by the <a href="http://www.cs.cmu.edu/~dbohus/ravenclaw-olympus/roomline.html" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.cs.cmu.edu');" target="_blank">RoomLine application</a> which show, in my opinion, already a great business potential yet not exploited at all by the market.</p>
<p><strong>Dialog Management in the Business world:</strong></p>
<p>While Chatterbots found their place in casual textual chatting, Dialog Managers tended to penetrate the vocal environment, but first, we needed a standard: AT&amp;T, IBM, Lucent, and Motorola formed the <a href="http://en.wikipedia.org/wiki/VoiceXML" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target='_blank'>VoiceXML</a> Forum in March 1999, in order to develop a standard markup language for specifying voice dialogs. They published the VocieXML 0.9 standard the same year and the version 1.0 in 2000, followed by the version 2.0 in 2003.</p>
<p>In that light, the field has been heavily pushed in the direction of speech recognition and some very large company were created, like <a href="http://www.nuance.com" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.nuance.com');" target="_blank">Nuance</a>, the worldwide leader. The company was founded in 1992 and has now a market capitalization of 3.5 billion USD. Even if most of their products are related to speech recognition and document management, their <a href="http://www.nuance.com/speech/foundation/dialog/" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.nuance.com');" target="_blank">products line based on dialog management</a> is a valuable and growing part of their revenue.</p>
<p><strong>The future:</strong></p>
<p>Nobody knows how long Chatterbots based on pattern matching will find their place in ther market world and how many cases will be necessary for the market to finally understand the limitation of this approach.</p>
<p>On the other side, the hype of Dialog Management is still to come and it is to hope that they will go to the same heights than Chatterbots. We can already see the accomplishment of their early potential in the impressive works done at CMU and it is, in my opinion, just a matter of time until these technologies hit the market properly.</p>
<p>The interesting thing with the hype driven by Chatterbots, is that there is a clear desire especially in our current market context in having machines understand us.  The key lies not just in spewing out a response but rather in having computers ‘understand’ customers and interpret the understanding based on learned ‘experience’.  This in my view is where Dialog Management is taking us, with this approach we can see the basis for evolution, even organically, learning and adapting to ‘experience’ … while enabling technology to deliver the ‘understanding’, guidance and result we are all looking for.</p>

]]></content:encoded>
			<wfw:commentRss>http://www.sylvainpaillard.com/wordpress/?feed=rss2&amp;p=216</wfw:commentRss>
		</item>
		<item>
		<title>How artificial intelligence will revolutionize the customer experience and empower marketing investments</title>
		<link>http://www.sylvainpaillard.com/wordpress/?p=204</link>
		<comments>http://www.sylvainpaillard.com/wordpress/?p=204#comments</comments>
		<pubDate>Wed, 03 Jun 2009 10:54:59 +0000</pubDate>
		<dc:creator>Sylvain Paillard</dc:creator>
		
		<category><![CDATA[artificial intelligence]]></category>

		<category><![CDATA[customer experience]]></category>

		<category><![CDATA[Marketing Investments]]></category>

		<category><![CDATA[Webcom]]></category>

		<guid isPermaLink="false">http://www.sylvainpaillard.com/wordpress/?p=204</guid>
		<description><![CDATA[I spoke as Keynote Speaker at the Webcom event in Montréal the 13th of May 2009.
The main goal of this speech was to explain why technologies driven by Artificial Intelligence are going to be the key success factor to provide a positive customer experience online. The main reason I advanced is that the complexity of [...]]]></description>
			<content:encoded><![CDATA[<p>I spoke as Keynote Speaker at the <a href="http://www.webcom-montreal.com" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.webcom-montreal.com');" target="_blank">Webcom event</a> in Montréal the 13th of May 2009.</p>
<p>The main goal of this speech was to explain why technologies driven by Artificial Intelligence are going to be the key success factor to provide a positive customer experience online. The main reason I advanced is that the complexity of the task of satisfying all the different of the customer requests properly is way too high to be managed with any traditional approach.</p>
<p>You can find here the main content of this speech: the <a href="http://www.sylvainpaillard.com/wordpress/wp-content/uploads/2009/06/sylvain-paillard-webcom-keynote-may09.pdf" onclick="javascript:pageTracker._trackPageview('/downloads/wordpress/wp-content/uploads/2009/06/sylvain-paillard-webcom-keynote-may09.pdf');" target="_blank">Presentation slides</a> and the video of the keynote as well as an short interview I gave the same day (unfortunately, the videos are in French).</p>
<p style="text-align: center;"><a href="http://www.sylvainpaillard.com/wordpress/wp-content/uploads/2009/06/gun.png" ><img class="aligncenter size-full wp-image-214" title="A positive customer experience can be hard to reach..." src="http://www.sylvainpaillard.com/wordpress/wp-content/uploads/2009/06/gun.png" alt="A positive customer experience can be hard to reach..." width="471" height="471" /></a></p>
<p><span id="more-204"></span></p>
<p><strong>Keynote introduction:</strong></p>
<p>The most significant customer experience issues take place during a customer’s interactions with your company’s web site and contact center.  Negative experiences generate frustration and result in higher costs of support, while discouraging customers to adopt your company’s products/services, sometimes definitely.<br />
We will exemplify how most of these issues are connected with maladapted access to the most critical information of your company and the lack of unification between the different communication channels and data sources.<br />
We will present what are the critical aspects of these issues and how they can be solved by implementing an adapted Customer Experience Strategy.</p>
<p>We will explore how:</p>
<ul>
<li> Contextual Guidance solves the most critical search experience challenges</li>
<li>Social Profiling and Personalization shorten and enrich the exploration experience</li>
<li>Seamless intelligent escalations from the web to your contact center -  lower your costs of support and leverage the way customers interact with your company</li>
<li>1:1 Marketing Automation increase your sales and changes the way customers perceive your brand(s) and increase conversion rates significantly.</li>
</ul>
<p><strong>Video of the Keynote speech:</strong></p>
<p><em>[Not yet released, will add it as soon as it is done]</em></p>
<p><strong>Video of the Interview I did after:<br />
</strong></p>
<p><object width="425" height="344" data="http://www.youtube.com/v/Dk9-d4MknW8&amp;hl=en&amp;fs=1" type="application/x-shockwave-flash"><param name="allowFullScreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.youtube.com/v/Dk9-d4MknW8&amp;hl=en&amp;fs=1" /><param name="allowfullscreen" value="true" /></object></p>

]]></content:encoded>
			<wfw:commentRss>http://www.sylvainpaillard.com/wordpress/?feed=rss2&amp;p=204</wfw:commentRss>
		</item>
		<item>
		<title>A new approach for online engagement</title>
		<link>http://www.sylvainpaillard.com/wordpress/?p=192</link>
		<comments>http://www.sylvainpaillard.com/wordpress/?p=192#comments</comments>
		<pubDate>Mon, 06 Apr 2009 13:56:15 +0000</pubDate>
		<dc:creator>Sylvain Paillard</dc:creator>
		
		<category><![CDATA[customer experience]]></category>

		<category><![CDATA[Online Engagement]]></category>

		<guid isPermaLink="false">http://www.sylvainpaillard.com/wordpress/?p=192</guid>
		<description><![CDATA[Most companies experience a large difference between what their web site is and what they believe it should be. This frustration is quite continuous and the positive feeling of making progress with a new version quickly vanishes after its launch.
Web site suppliers understood this problematic quite well, but didn&#8217;t find the solution. Instead, they use [...]]]></description>
			<content:encoded><![CDATA[<p>Most companies experience a large difference between what their web site is and what they believe it should be. This frustration is quite continuous and the positive feeling of making progress with a new version quickly vanishes after its launch.</p>
<p>Web site suppliers understood this problematic quite well, but didn&#8217;t find the solution. Instead, they use it to their benefits, leveraging it on one side to sell a new expensive version and requesting on the other side the necessary validations from the customer before the creation process is over (when there is still some euphoria of what they believe it will do for them), to make sure there are concrete proofs that the customer satisfaction was reached (as waiting until after the launch would be suicide).</p>
<p>In this post, I make the assumption that the problematic relies at the technology level as customers want their web site to be an attractive and efficient online engagement platform, while web site technology can only deliver an access to a set of fixed information segmented into information pages. As a result, the engagement would have to emerge magically out of the intelligence of the content of the pages, which, except in a few very specific situations, will never happen.</p>
<p><strong>What if we could put in place a better technology which would make web sites more dynamic, not in their content, but in their structure?</strong></p>
<p><strong>What if the resulting flow of interaction was adapting automatically and if the content of the web site was the documentation along this flow?</strong></p>
<p><span id="more-192"></span>Before explaining this approach, I would like to address a few key concepts:</p>
<p><strong>Online Engagement<br />
</strong></p>
<p><em>We define the process of online engagement as transforming visitors of the web site into customer prospects, meaning that they terminate their online interactions by manifesting their intent of becoming a customer. This intent can be converted immediately online, in the case of e-commerce, or through the contact with a sales agent. We call the transfer of the request from the online world to the offline world an &#8220;Escalation&#8221;.<br />
</em></p>
<p><strong>Information Access<br />
</strong></p>
<p><em>Accessing the information is a key aspect of engagement, as no visitors want to become a customer if they didn&#8217;t find anything interesting for them. The success of this critical phase of online engagement depends both on the content and on the technology used to provide the access to this content.</em></p>
<p><strong>Online Qualification<br />
</strong></p>
<p><em>We define the process of online qualification as the collection of customer data which are required to finalize the engagement process. There is no limit on to what type of information the qualification can be (e.g.: socio-demographic information, products of interests, etc.). Their gathering represents a key aspect of Online Engagement, as requesting too many data from customers can lower their interest to the point they are not willing to complete it anymore.<br />
</em></p>
<p><strong>Interaction Flow<br />
</strong></p>
<p><em>We define the Interaction Flow as the chain of interactions between the user and the web site. The interaction flow can be pictured as a succession of states the user has visited (traditionally HTML pages) connected by the selected transitions (traditionally URL links).</em></p>
<p><strong>Contextualization<br />
</strong></p>
<p><em>When considering the Context throughout the Interaction Flow of an Information Access process, the context is initially very large (the whole web site) and becomes very small along the way (to a very specific piece of information). We call the process of passing from a very large context to a very small one the Contextualization.</em></p>
<p><strong>The concept of my new approach to online engagement relies on making all the ending points of the Contextualization process to become the points of Escalation (requiring no more, or very limited additional qualification). From this basis, monitoring the success of the web site as an online engagement platform can be simplified to the analysis the conversion rate of each point of escalation, as well as all the ending states of each unfinished Contextualization.</strong></p>
<p>To make this concept possible in the reality, we need a technology which could generate the Contextualization automatically (e.g.: from both the company&#8217;s products meta-data and customer qualification requirements). As the only goal of the web site would become the Contextualization (bringing the user to the most relevant information, while gathering the required information about him), the structure would have to provide the supporting information for all the possible interaction states.</p>
<p>The required information would then be provided not as information pages as it is the case today, but as explanation around the different Contextualization steps.</p>
<p>Let me give an example to illustrate this approach:</p>
<blockquote><p><em>Company A provides 9 products, which can be categorized in 3 domains sales,support,supply; and in 3 functions optimize,manage,analyse (i.e.: one product for each combination).</em></p>
<p><em>Let&#8217;s consider that the price (and difference of features) of the products depends on 2 factors: the customer size: big,medium,small; and the customer industry: bank,telco,IT.</em></p>
<p><em>Let&#8217;s consider that there is 5 information required to qualify a customer: name, position (CEO, CTO, others), industry, domain of interest (sales, support, supply) and e-mail</em></p>
<p><em>If we consider that the name and e-mail are the 2 only elements which should not be considered during the Contextualization, we can see we have 5 possible states for the interaction flow (domain, function, size, industry, position).</em></p>
<p><em>The approach would imply to create the required information to support all the possible Contextualization process cases.<br />
</em></p></blockquote>
<p>As, in our example there is 3 possible values per attribute, we can see that the number of possible ending points of the Contextualization are 3^5 = 243 and the number of possible Contextualization processes (considering the first step dynamically selected as well as all the others) 5*3*4*3*3*3*2*3*1*3, or 5!*3^5 = 120*243 = 29&#8242;160.</p>
<p>We can immediately see that the possible dimensions of both of them are too big to prepare a specific textual information to support each. This challenge can be addressed if the information provided is either not static (e.g.: pricing information and ROI calculation depending of dynamic parameter), or the same in many cases (e.g.: the message to explain the pricing can be the same for a CEO looking to optimize support or the a CTO looking to manage supply, as, in the case explained above, the differentiating price factors are only the customer size and the industry).</p>
<p>However, when looking at the most popular Contextualization cases, the customer might be willing to differentiate the message at every state for his most important cases to receive the most personalized information possible.</p>
<p>In addition to provide a direct connection between the Information Access and the Escalation, this approach has the big advantage to provide a possible context for each piece of information a company wants to put on his web site: a context connected to the intent and the profile of the customer.</p>
<p>It is therefore more likely that the company will naturally provide engaging information which makes sense for the customer and his current context of mind and therefore will result in more engaging information in addition to a more engaging process.</p>

]]></content:encoded>
			<wfw:commentRss>http://www.sylvainpaillard.com/wordpress/?feed=rss2&amp;p=192</wfw:commentRss>
		</item>
	</channel>
</rss>
