<?xml version='1.0' encoding='UTF-8'?><rss xmlns:atom='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' version='2.0'><channel><atom:id>tag:blogger.com,1999:blog-16014280</atom:id><lastBuildDate>Wed, 26 Mar 2008 18:29:25 +0000</lastBuildDate><title>XODP Blog</title><description/><link>http://blog.xodp.org/</link><managingEditor>Internet Esquire</managingEditor><generator>Blogger</generator><openSearch:totalResults>57</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-343631537419134081</guid><pubDate>Wed, 26 Mar 2008 18:19:00 +0000</pubDate><atom:updated>2008-03-26T11:29:10.730-07:00</atom:updated><category domain='http://www.blogger.com/atom/ns#'>Search</category><category domain='http://www.blogger.com/atom/ns#'>Jimbo Wales</category><category domain='http://www.blogger.com/atom/ns#'>Nupedia</category><category domain='http://www.blogger.com/atom/ns#'>Free Content</category><category domain='http://www.blogger.com/atom/ns#'>Wikimedia</category><category domain='http://www.blogger.com/atom/ns#'>Citizendium</category><category domain='http://www.blogger.com/atom/ns#'>Wikia</category><category domain='http://www.blogger.com/atom/ns#'>Free Software</category><category domain='http://www.blogger.com/atom/ns#'>Wikipedia</category><category domain='http://www.blogger.com/atom/ns#'>Google</category><category domain='http://www.blogger.com/atom/ns#'>Web 2.0</category><title>The Ongoing Viability of Wikia and Wikimedia</title><description>For reasons that defy rational explanation, the powers that be at &lt;A href="http://www.wikia.com"&gt;Wikia&lt;/A&gt; dispute the claim that Wikia is the for-profit counterpart of the non-profit &lt;A href="http://www.wikimedia.org/"&gt;Wikimedia Foundation&lt;/A&gt;.  No doubt important distinctions can be made between these two business entities, but the one that stands out to most people is that the for-profit Wikia's primary source of funding is venture capital and banner advertising, whereas the non-profit Wikimedia relies entirely upon charitable donations, and that distinction does not change the fact that both entities sprang from common roots, much like modern primates and humans &lt;A href="http://anthropology.si.edu/HumanOrigins/ha/primate.html"&gt;evolved from a common ancestor&lt;/A&gt;.  In sum, Wikia and Wikimedia are both &lt;A href="http://tech.groups.yahoo.com/group/xodp/message/1792"&gt;free software/free content success stories&lt;/A&gt; that involve many of the same characters and that are still being written.&lt;br /&gt;&lt;br /&gt;In a blog entry that I posted over a year ago, I asserted that &lt;A href="http://blog.xodp.org/2007/02/rumors-of-wikipedias-imminent-demise.html"&gt;rumors of Wikipedia's imminent demise&lt;/A&gt; were greatly exaggerated, &lt;A href="http://www.wikipedia.org/"&gt;Wikipedia&lt;/A&gt; being the original free content/free software project from which the Wikimedia Foundation was born.  Indeed, even if both Wikia and Wikimedia were to go out of business tomorrow for some reason, the software and content that they've generated up until now would persist and remain viable for the foreseeable future in some other form.  Only a far-reaching, cataclysmic event could change that because both the data and software for sites like Wikipedia are free to anyone who wants them, and a truly unique &lt;A href="http://blog.xodp.org/2007/03/upgrading-to-web-30.html"&gt;Web 2.0&lt;/A&gt; informational resource like Wikipedia puts &lt;A href="http://www.google.com/"&gt;Google&lt;/A&gt; and all other search engines like Google to shame in certain ways.  Most notably, Wikipedia does a much better job than Google does when it comes to &lt;A href="http://blog.xodp.org/2006/08/keywords-semantics-and-disambiguation.html"&gt;disambiguation&lt;/A&gt; of keywords.&lt;br /&gt;&lt;br /&gt;The key to Wikipedia's success is the more or less selfless contributors who can and often do come and go at will and may number from the hundreds to the hundreds of thousands, depending on whose being counted and whose doing the counting.  And while Wikia's various communities are diminutive by comparison to Wikipedia, they operate on the same principles of openness and freedom, and they are just as capable of generating quality content on a wide variety of topics.  And while both Wikia and Wikimedia sustain equally viable and "free" communities, they are very different business entities when it comes to their core philosophies in re commercial advertising.  To wit, Wikimedia goes out of its way to shelter its users from &lt;A href="http://en.wikipedia.org/wiki/Wikipedia:Advertisements"&gt;commercial advertising&lt;/A&gt;, whereas Wikia supports its 5,500 or so "free" online communities with above the fold banner advertisements.  Not that there's anything wrong with that.  As &lt;A href="http://www.gnu.org/philosophy/free-sw.html"&gt;the old GNU saying goes&lt;/A&gt;, " [T]hink of free as in free speech not as in free beer."&lt;br /&gt;&lt;br /&gt;As of this writing, Wikipedia dominates Google's search results and Wikia's Alexa Ranking is a respectable 339 and rising, with both funding and actual revenue continuing to increase, thereby giving one every reason to believe that the thousands of online communities associated with Wikia and Wikimedia will continue to grow and thrive.  At the same time, I would be remiss if I did not comment on the inadequacies of &lt;A href=" http://search.wikia.com/"&gt;Wikia Search&lt;/A&gt;, a Wikia product that went live in January of 2008.  Don't get me wrong:  I think it's a great idea to have human editors rank and rate search results.  That's why I started compiling the &lt;A href="http://www.xodp.org/about.html"&gt;XODP Search Results Guides&lt;/A&gt;, to work in conjunction with the brute force of existing search engine algorithms, providing added value after said algorithms had done virtually all of the heavy lifting.  That having been said, there is a general consensus that Wikia Search truly sucks, and I'm quite mystified as to what Jimbo Wales and his cohorts are trying to accomplish.&lt;br /&gt;&lt;br /&gt;Like the late great &lt;A href="http://dmoz.org/"&gt; Open Directory Project&lt;/A&gt; (aka ODP, aka dMOZ), Wikia Search hopes to employ an army of volunteers to do . . . well, after reading through the &lt;A href="http://lists.wikia.com/pipermail/search-l/"&gt;Wikia Search Mailing List Archives&lt;/A&gt;, that's not exactly clear.  There doesn't seem to be any sort of workable theme behind Wikia Search, and the idea of "trusted user feedback" doesn't seem to have any context or relevance to a wiki-based search engine.   What wikis do quite well is allow an exceptionally large group of users to collaborate on content generation, but the only things that this seems to bring to the search engine technology table are: (1) disambiguation of keyword-based search queries; (2) trusted sources of URLs; and (3) the possibility of trustworthy URL meta data.   (In theory, the late great ODP was supposed to provide some trustworthy meta data, but ODP is now a historical object lesson in large scale and recalcitrant denial of &lt;A href="http://www.traffick.com/story/06-2000-xodp.asp"&gt;quality control failure&lt;/A&gt;.)&lt;br /&gt;&lt;br /&gt;The true believers of Wiki Search counter their critics by pointing to the unlikely success of Wikipedia and the aforementioned inaccurate rumors of Wikipedia's imminent demise.  I think a better comparison would be the recalcitrant denial of Larry Sanger and company in re the failure of the &lt;A href="http://nupedia.8media.org/"&gt;Nupedia concept&lt;/A&gt; by creating &lt;A href="http://en.citizendium.org/wiki/Main_Page"&gt;Citizendium&lt;/A&gt;, a failure that has already been answered by the unqualified success of Wikipedia.  To wit, Wikia Search is a vaporware solution in search of a problem that already has an adequate solution.  This sad current state of affairs in re Wikia Search doesn't detract from the accomplishments of Wikia and Wikimedia, which &lt;A href="http://ap.google.com/article/ALeqM5i8x-wlh7nMm12x_kVQ6MZxrYWK9QD8VKMV1G0"&gt;by all accounts&lt;/A&gt; should remain viable entities for the foreseeable future.</description><link>http://blog.xodp.org/2008/03/ongoing-viability-of-wikia-and.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-1991116466517219287</guid><pubDate>Thu, 29 Nov 2007 18:18:00 +0000</pubDate><atom:updated>2007-11-29T10:35:56.889-08:00</atom:updated><category domain='http://www.blogger.com/atom/ns#'>Jimbo Wales</category><category domain='http://www.blogger.com/atom/ns#'>dMoz</category><category domain='http://www.blogger.com/atom/ns#'>ODP</category><category domain='http://www.blogger.com/atom/ns#'>Project Napa</category><title>This Just In:  ODP Still Sucks</title><description>While Googling my byline a month or so ago to keep current on any recent gossip about me, I stumbled upon &lt;A href="http://forums.digitalpoint.com/showthread.php?p=4460117"&gt;a thread at Digital Point&lt;/A&gt; discussing the merits (or lack thereof) of the &lt;A href="http://www.dmozsucks.org/"&gt;Dmoz Sucks&lt;/A&gt; website.  One "Rezo,"  who is apparently (one of?) the proprietor(s) of the &lt;A href="http://www.sevenseek.com/"&gt;SevenSeek&lt;/A&gt; web directory, pointed out &lt;A href="http://dmoz.org/"&gt;Dmoz&lt;/A&gt; (aka ODP) sucks (ambiguity intentional) and recommended browsing the eponymous Dmoz Sucks website; one "nebuchadrezzar" followed up with a glowing reference of both me and XODP, closing with, "You guys should make yourselves known to [David Prenatt], he would probably give you a few pointers."&lt;br /&gt;&lt;br /&gt;Nebuchadrezzar also pointed out the fact that &lt;A href="http://tech.groups.yahoo.com/group/xodp/message/559"&gt;Jimbo Wales frequented XODP back in the day&lt;/A&gt; and opined that XODP may have played a role in helping Jimbo avoid many of the pitfalls of ODP in creating &lt;A href="http://www.wikipedia.org/"&gt;Wikipedia&lt;/A&gt;.  I'd like to think that were true, but (knowing Jimbo) I doubt that he would acknowledge that XODP had any kind of profound impact on him or on Wikipedia.  Even so, Wikipedia is precisely the sort of open content community that I had hoped to inspire and/or create &lt;A href="http://www.traffick.com/story/06-2000-xodp.asp "&gt; when I founded XODP&lt;/A&gt;, and while I have my reservations about Wikipedia, they are minor ones, and I make a point of extolling the virtues of Wikipedia from time to time on XODP and elsewhere.&lt;br /&gt;&lt;br /&gt;I first came across Jimbo Wales during my tenure as Chief Evangelist at the-late-and-never-that-great Wherewithal.  Much to my surprise, Jimbo had a distant history with one of Wherewithal's founders, and there was some talk of licensing Wherewithal's ad serving technology to Jimbo's company &lt;A href="http://www.bomis.com/"&gt;Bomis&lt;/A&gt;.  At the time, my stock as an Internet celebrity was very high because of XODP, which is how I and Wherewithal first came to Jimbo's attention.  At the time, I was also consulting with &lt;A href="http://www.prnewswire.com/cgi-bin/stories.pl?ACCT=104&amp;STORY=/www/story/05-02-2000/0001207194&amp;EDATE="&gt;Project Napa&lt;/A&gt; as their Chief Ontologist and Community Architect, and I also tried to sell them on the idea of serving up ads a la Wherewithal as a way of generating revenue for their open content people portal.  However, both of these promising business leads (and many others) were burned by Wherewithal's founders who simply did not have the wherewithal (pun intended) to put a profitable business deal together.  Consequently, I stopped bringing the leads to Wherewithal and started operating in stealth mode, quietly playing matchmaker with my various business contacts while Wherewithal faded into obscurity.&lt;br /&gt;&lt;br /&gt;The dot-com bubble had already begun to burst when I signed on at Wherewithal and Project Napa, but I felt (and still feel) that there was (and still is) quite a bit of promise in the area of open content generation and indexing.  Even now I think that the ideas underlying Wherewithal and Project Napa were sound, and I've long toyed with the idea of reincarnating both of them in some form.  Instead, I have focused on the needs of my paying clients; occasionally, I have worked on various &lt;A href="http://www.xodp.org/about.html"&gt;XODP Web Guides&lt;/A&gt; and contributed to Wikipedia to satisfy my innate need to index online content.&lt;br /&gt;&lt;br /&gt;In addition to satisfying my innate need to index online content, the XODP Web Guides have allowed me to demonstrate "proof of concept."  And but for the fact that I am making a decent living as &lt;A href="http://www.netesq.com/about.html"&gt;an Internet consultant for attorneys&lt;/A&gt;, work that I truly enjoy, I would embrace the opportunity to make more money by publishing XODP Web Guides while also improving the quality of online content indexing.  Indeed, there may yet come a day when an investor throws so much money at me that I will take the necessary time and effort to hire and train enough staff to publish and maintain thousands (or even tens of thousands) of XODP Web Guides instead of the three hundred or so I currently publish.  Of course, if making money were my primary motivation for indexing online content, I would probably start up a Web Directory like the aforementioned Seven Seek and charge for website submissions.&lt;br /&gt;&lt;br /&gt;While I don't know how Seven Seek is doing financially, charging for website submissions is a proven business model that has worked on a large scale for &lt;A href="https://ecom.yahoo.com/dir/submit/intro/"&gt;Yahoo!&lt;/A&gt; and &lt;A href="http://www.looksmart.com/"&gt;LookSmart&lt;/A&gt; and on a smaller scale for &lt;A href="http://www.goguides.org/"&gt;GoGuides&lt;/A&gt; and &lt;A href="http://botw.org/"&gt;Best of the Web&lt;/A&gt;.  Getting back to the subject implied by the title of this post, if there is a failure inherent in the ODP business model, it is the fact that ODP has no business model.  Rather, when &lt;A href="http://tech.groups.yahoo.com/group/xodp/message/1832"&gt;Netscape acquired ODP for the sum of one million dollars back in 1999&lt;/A&gt;, it purchased the good will of the then nascent open content indexing community.&lt;br /&gt;&lt;br /&gt;When Netscape acquired ODP, it almost seemed to make sense, as Netscape had (a largely unearned) reputation for being a good 'Netizen.  However, when AOL acquired Netscape and gave away ODP's content (albeit with bizarre and probably unenforceable licensing restrictions), it made sense to do so in a different way:  Business sense.  To wit, because AOL exercised editorial control over ODP content, it was able to offer &lt;I&gt;quid pro quo&lt;/I&gt; to professional content providers like AOL and Rolling Stone.  It also took the profit out of LookSmart's bottom line as a content provider, thereby appearing to eliminate a serious AOL competitor from growing and prospering.  However, when LookSmart turned the tables on AOL and offered to share advertising revenue with its publishers, ODP became totally irrelevant and has been ever since.&lt;br /&gt;&lt;br /&gt;My original purpose in founding XODP was to point out the flaws in ODP and foster discussion about the future of the Open Content Movement.  Since that time, Wikipedia has deposed ODP and rightfully claimed the mantle of that movement with a viable business model that relies upon charitable donations.  However, rather than letting ODP die with dignity, AOL keeps ODP on public display in its critical care unit, a sad state of affairs that will probably last at least until the next ODP system crash occurs.  In a perfect world, the powers that be at Wikipedia would acquire ODP and make it a truly open directory, but I wouldn't hold my breath waiting for that to happen.  As for the proprietors of the aforementioned Dmoz Sucks website, I wish them well, and they are more than welcome to join XODP and rant.  However, I really can't be bothered to review their website any deeper than their Home Page and offer a substantive opinion on it:  The fact that ODP sucks ain't news, and it hasn't been for quite some time.</description><link>http://blog.xodp.org/2007/11/this-just-in-odp-still-sucks.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117428431680437403</guid><pubDate>Mon, 19 Mar 2007 08:32:00 +0000</pubDate><atom:updated>2007-03-19T01:34:07.653-07:00</atom:updated><title>Upgrading to Web 3.0</title><description>The term "Web 2.0" was coined sometime in 2004 by Dale Dougherty of O'Reilly Media.  Far from a new technical standard, Web 2.0 began as a marketing gimmick for the eponymous &lt;A href="http://www.web2con.com/"&gt;Web 2.0 Summit&lt;/A&gt;, an annual high tech trade show that will be in its fourth year as of October 2007.  As &lt;A href="http://www.oreillynet.com/pub/a/oreilly/tim/news/2005/09/30/what-is-web-20.html"&gt;Tim O'Reilly wrote in September of 2005&lt;/A&gt;:&lt;BLOCKQUOTE&gt;&lt;I&gt;"The bursting of the dot-com bubble in the fall of 2001 marked a turning point for the web. . . .&lt;BR&gt;&lt;BR&gt;" . . . Far from having 'crashed,' the web was more important than ever, with exciting new applications and sites popping up with surprising regularity. What's more, the companies that had survived the collapse seemed to have some things in common. . . ."&lt;/I&gt;&lt;/BLOCKQUOTE&gt;Far from a universal or even ubiquitous technical meme, Web 2.0 has nonetheless come to mean many things to many people in the high tech industry while meaning absolutely nothing to others.  But this hasn't stopped many people in the know from calling for a Web 3.0 upgrade.  The first person worthy of note to call for such an upgrade was Jeffrey Zeldman, who did so tongue-in-cheek as part of &lt;A href="http://www.alistapart.com/articles/web3point0"&gt;a rant against the Web 2.0 meme&lt;/A&gt;:&lt;blockquote&gt;&lt;I&gt;"When I started designing websites, if the guy on the plane next to me asked what I did, I had to say something like 'digital marketing' if I wanted to avoid the uncomprehending stare.&lt;BR&gt;&lt;BR&gt;"A few years later, if I told the passenger beside me I was a web designer, he or she would regard me with a reverence typically reserved for Stanley-Cup-winning Nobel Laureate rock stars.&lt;BR&gt;&lt;BR&gt;"Then the bubble burst, and the same answer to the same question provoked looks of pity and barely concealed disgust. . . .&lt;BR&gt;&lt;BR&gt;"[ . . . ]&lt;BR&gt;&lt;BR&gt;"Eventually . . . [t]he web was "back" even though it had never left. . . .&lt;BR&gt;&lt;BR&gt;"[ . . . ]&lt;BR&gt;&lt;BR&gt;"But how to persuade the other sharks in the tank that this blood feast was different from the previous boom-and-bust? Easy: Dismiss everything that came before as 'Web 1.0.'"&lt;/I&gt;&lt;/blockquote&gt;As a technical meme, Web 3.0 assumes the validity of a still emerging Web 2.0 meme, and then asks, "Who and what will be left standing when the next economic bust and boom cycle has finished ravaging the Internet?"  I'd like to think that sexy applications like &lt;A href="http://blog.xodp.org/2007/02/powerset-redux.html"&gt;natural language processing&lt;/A&gt;, artificial intelligence, and &lt;A href="http://blog.xodp.org/2007/02/grokking-semantic-web.html"&gt;the Semantic Web&lt;/A&gt; will rise from the ashes.  However, I've been through the cycle of disruptive technologies too many times to expect this sort of purposeful progress.</description><link>http://blog.xodp.org/2007/03/upgrading-to-web-30.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117316370796423888</guid><pubDate>Sat, 10 Mar 2007 23:10:00 +0000</pubDate><atom:updated>2007-03-14T01:21:34.930-07:00</atom:updated><title>The Waning Relevance of Search Engines</title><description>When the Web was young -- i.e., before the advent of Yahoo! and Google -- &lt;A href="http://www.columbia.edu/~rh120/ch106.x10"&gt;USENET&lt;/A&gt;[sic] was the primary medium for fielding informational queries on the Internet, and &lt;A href="http://www.faqs.org/"&gt;USENET FAQs&lt;/A&gt; were the primary medium by which such queries were answered.  Incidental to such FAQs were pointers to particularly useful URLs that people had discovered and vetted by following trails of informational bread crumbs.  Most of these trails started on USENET.  However, over time, more and more Home Pages on the World Wide Web became populated with links to interesting and useful websites, and then along came the spiders.&lt;br /&gt;&lt;br /&gt;Spiders (now more commonly known as web crawlers) created the first indexes[sic] of the World Wide Web by following links from one URL to another and making a record of what they found along the way.  An individual spider or collection of spiders working together can produce a somewhat comprehensive database of URLs on the World Wide Web.  However, that database is (at best) a collection of snapshots from some time in the recent past rather than a live feed that includes recent changes, and it will not include URLs found on "The Invisible Web."  These distinctions were once lost on most people, who assumed that a "'Net search" would magically provide up to the minute information from the Web; this distinction is still lost on some people who have no idea how a search engine works.&lt;br /&gt;&lt;br /&gt;The term &lt;I&gt;search engine&lt;/I&gt; predates the Web.  The earliest reference to the term that I've found was by &lt;A href="http://www.cs.utah.edu/~hollaar/"&gt;Professor Lee A. Hollaar&lt;/A&gt; of Utah University back in March of 1985.  (&lt;I&gt;The Utah Text Search Engine: Implementation Experiences and Future Plans&lt;/I&gt;, Proceedings of the Fourth International Workshop on Database Machines.)  But the term really caught on with the Internet community in the early 1990s.  By that time, spiders were already cataloging an enormous amount of content found on the Visible Web.  Even so, very few people otherwise in the know appreciated just how important search engines would soon become, erroneously assuming that anything more complex than the &lt;A href="http://www.gnu.org/software/grep/"&gt;Unix "grep" query&lt;/A&gt; was overkill.&lt;br /&gt;&lt;br /&gt;A former colleague of mine was a product manager for Netscape before the company was acquired by America Online, and he once recounted to me how popular the search function was on the  Netscape website from the very beginning.  Meanwhile, almost everyone at Netscape was trying to figure out how to sell the company's Web browsers and its &lt;A href="http://en.wikipedia.org/wiki/Netscape#Product_list"&gt;easily forgotten line of related software products&lt;/A&gt;, all of which became completely irrelevant when Microsoft started giving away the Internet Explorer Web browser.  Even Yahoo! failed to recognize the importance of search-related products, quickly diversifying from a searchable Web directory into a Web portal and outsourcing its search services to Google until a few years ago.&lt;br /&gt;&lt;br /&gt;For all its flaws, &lt;A href="http://infolab.stanford.edu/~backrub/google.html"&gt;the Google algorithm&lt;/A&gt; is still the standard by which all other search engine algorithms and post-Google information retrieval mechanisms are judged.  The only post-Google innovation in content indexing and retrieval that even comes to close to being such a standard is Wikipedia, and most of those who consider Wikipedia a successful innovation do so because of Wikipedia's prominence and visibility in Google's search results.  Even &lt;A href="http://tech.groups.yahoo.com/group/xodp/message/2259"&gt;the blogosphere's importance&lt;/A&gt; is largely validated by the impact that it has on Google's search results.  However, the importance of search engines reached its zenith a while back, and their relevance is slowly waning.&lt;br /&gt;&lt;br /&gt;As &lt;A href="http://blog.xodp.org/2007/02/wikipedias-feckless-attempts-to-fight.html"&gt;I reported previously&lt;/A&gt;, Wikipedia recently announced that outbound links from Wikipedia to other websites would include a "nofollow" attribute.  The rationale for this decision is that it will reduce the incentive for "black hat" search engine optimizers (SEOs) to spam Wikipedia, as a link from an important website like Wikipedia will normally inflate the PageRank that Google assigns to that link's destination URL.  However, I think the end result will be something quite different.  To wit, to the extent that Google actually ignores outbound links from Wikipedia, Wikipedia will actually offer something unique and different from Google to its end users.  I don't think this will put Google out of business, but it will eventually diminish the relevance of search engines as the primary medium for fielding online informational queries.&lt;br /&gt;&lt;br /&gt;In a vein similar to Wikipedia, sites like &lt;A href="http://blog.xodp.org/2007/03/digg-effect.html"&gt;Digg&lt;/A&gt;, &lt;A href="http://www.fark.com/"&gt;Fark&lt;/A&gt;, &lt;A href="http://technorati.com/"&gt;Technorati&lt;/A&gt;, &lt;A href="http://reddit.com/"&gt;reddit&lt;/A&gt;, and &lt;A href="http://del.icio.us/"&gt;del.icio.us&lt;/A&gt; are emerging as places for the purveyors of information to hook up with their audiences with less and less concern for the impact that such sites have upon a(n) URL's Google PageRank.  Moreover, like Wikipedia, certain online portals are so prominent that the only reason anyone uses a search engine to find them is because many end users are not in the habit of using bookmarks to pull them up -- e.g., &lt;A href="http://www.ebay.com/"&gt;eBay&lt;/A&gt; and &lt;A href="http://blog.xodp.org/2006/08/advent-of-myspace.html"&gt;MySpace&lt;/A&gt;.  Even people who choose to use a search engine to find websites with the products, services, or information that they want sometimes complain that &lt;A href="http://burningbird.net/technology/wikipedia-walking"&gt;highly commercialized search results require a serious researcher to dig to the second or third page of search results to find something useful.&lt;/A&gt;  In sum, while far from obsolete, the relevance of search engines is slowly waning, and commercial interests are slowly turning their attention to other channels of online marketing.</description><link>http://blog.xodp.org/2007/03/waning-relevance-of-search-engines.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117339765928438720</guid><pubDate>Fri, 09 Mar 2007 22:29:00 +0000</pubDate><atom:updated>2007-03-19T10:09:16.406-07:00</atom:updated><title>Jimbo Wales Feeds Wikipedians and the Press a Red Herring</title><description>According to &lt;a href="http://news.yahoo.com/s/ap/20070307/ap_on_hi_te/wikipedia_credentials"&gt;Brian Bergstein of the Associated Press&lt;/a&gt;:&lt;blockquote&gt;&lt;I&gt;"Following revelations that a high-ranking member of Wikipedia's bureaucracy used his cloak of anonymity to lie about being a professor of religion, the free Internet encyclopedia plans to ask contributors who claim such credentials to identify themselves."&lt;/I&gt;&lt;/blockquote&gt;Discussions regarding this proposal are still underway at Wikipedia, but when &lt;A href="http://en.wikipedia.org/w/index.php?title=User_talk%3AJimbo_Wales%2FCredential_Verification&amp;diff=113617898&amp;oldid=113617175"&gt;news of the above-quoted article reached Jimbo Wales Wikipedia User Page space&lt;/A&gt;, it became quite clear to me that Jimbo had already made up his mind about moving forward with this plan, and that the discussions he had initiated on his user page were simply a way of placating the opposition by giving them an opportunity to speak their mind.&lt;br /&gt;&lt;br /&gt;As someone who has repeatedly &lt;A href="http://en.wikipedia.org/w/index.php?title=User_talk%3AJimbo_Wales%2FCredential_Verification&amp;diff=113617898&amp;oldid=113617175"&gt;asserted Jimbo's lack of culpability&lt;/A&gt; in the recent Essjay scandal, I am very disappointed to see that Jimbo has offered up the red herring of verified credentials rather than address his [Jimbo's] failure to properly vet a pseudonymous individual whom he appointed to ArbCom, Wikipedia's court of final appeal.  Superficially, the Essjay case was about falsified credentials; at its core, it was about &lt;A href="http://blog.xodp.org/2007/02/respected-wikipedian-lies-to-press.html"&gt;an elaborate deception that Essjay rationalized&lt;/A&gt; as being necessary because he held positions of trust at Wikipedia.&lt;br /&gt;&lt;br /&gt;Verifying credentials will not address the core issue of deception by Wikipedia administrators. All it will do is validate a Larry Sanger-esque type of credentialism at Wikipedia and create an attractive nuisance for &lt;A href="http://blog.xodp.org/2007/03/credentialists-and-impostors.html"&gt;credentialists and impostors&lt;/A&gt;. Essjay outed himself as an impostor because he felt that he had to so when accepting his paid position at Wikia. As such, the best remedy for the sort of deception that Essjay perpetrated would be a policy that forced people seeking administrator privileges at Wikipedia to provide the powers that be at Wikimedia with their true identity; if Wikipedia administrators wish to remain pseudonymous to the rest of the world, that's their prerogative.</description><link>http://blog.xodp.org/2007/03/jimbo-wales-feeds-wikipedians-and.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117346067006038469</guid><pubDate>Fri, 09 Mar 2007 17:20:00 +0000</pubDate><atom:updated>2007-03-09T09:23:48.506-08:00</atom:updated><title>The Digg Effect</title><description>&lt;A href="http://www.digg.com/"&gt;The Digg website&lt;/A&gt; was launched just over two years ago on December 5th, 2004, and now boasts having some one million users.  For those of you who have never heard of Digg, it's a community-based website where members can submit news stories for the community's consideration, vote for the stories that they consider newsworthy, and "bury" the stories that they don't like.  In close conjunction with the advent of the blogosphere, Digg has emerged as an alternative news source where stories that would otherwise remain obscure become headline news.&lt;br /&gt;&lt;br /&gt;While following the trail of bread crumbs that is the blogosphere, I started noticing that many of the stories I was following were becoming very prominent on Digg.  I also noticed that many prominent bloggers attributed their blogging success to "being Dugg," and I immediately thought of how quickly a story could rise to prominence on the World Wide Web by being "Slashdotted."  The two concepts share much in common, but the initial seed for a Slashdot story must be planted by the powers that be at Slashdot, and there is no way to formally vote on a Slashdot story to make it more popular.&lt;br /&gt;&lt;br /&gt;After voting on a number of stories on Digg, I finally decided to post &lt;A href="http://digg.com/political_opinion/Uncovering_Pakistan_s_Involvment_in_9_11"&gt;my own&lt;/A&gt;.  As of this writing, my story has 16 Diggs and is ranked third in popularity out of some 260 "up and coming" stories under the category political opinion, and (based on my observations of the Digg system) it still has over 3 hours to become the most popular up and coming story.  Now the question is whether the powers that be at Digg will decide to "make it popular."  If they do, then my ISP will have to make sure that I do not become a victim of &lt;A href="http://blogs.tech-recipes.com/davak/2005/11/06/digg-effect-the-top-10-things-webmasters-should-know/"&gt;the Digg effect&lt;/A&gt;.</description><link>http://blog.xodp.org/2007/03/digg-effect.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117328529936718185</guid><pubDate>Wed, 07 Mar 2007 18:31:00 +0000</pubDate><atom:updated>2007-03-14T04:49:47.290-07:00</atom:updated><title>Putting Essjay's Fraud into Perspective</title><description>I have done my level best to avoid posting yet another entry on the XODP Blog about the Wikipedian impostor known as Essjay, limiting myself to commentary on other blogs and on Wikipedia itself.  However, I was very impressed with &lt;a href="http://world-history-blog.blogspot.com/2007/03/fake-professor-at-wikipedia.html"&gt;a post on Miland Brown's World History Blog&lt;/a&gt; that seemed to put the larger issue of Wikipedia's reliability into perspective:&lt;blockquote&gt;&lt;I&gt;". . . [T]his kind of deception is not new and is not limited to Wikipedia. Does anyone remember Jayson Blair? He both plagiarized and fabricated articles at the New York Times for several years. I do not think Blair has proven that the New York Times is a bad resource though despite his fraud.&lt;BR&gt;&lt;BR&gt;"And let's not forget about Stephen Glass at the New Republic. 27 of 41 stories written by Glass for the magazine contained fabricated material. He wrote such fake gems as a 15 year old at national hacker convention and a Church of George Herbert Walker Bush, Jesus Christ. I still like the New Republic as a resource despite the Glass incident.&lt;BR&gt;&lt;BR&gt;"People fabricating degrees is not new either. The Chronicle of Higher Education frequently exposes people in higher education with diploma mill degrees. For example, history adjunct Fred Ruhlman at the University of Tennessee was reported at Cliopatria, '(His) 'doctorate' is from 'the American University of London,' a notorious diploma mill and his book, Captain Henry Wirz and Andersonville Prison, was withdrawn from publication by the University of Tennessee Press because of its plagiarism from William Marvel's Andersonville: The Last Depot.' While embarrassing for the University of Tennessee, it hardly means UT degrees are now worthless.&lt;BR&gt;&lt;BR&gt;"I can find those fake Blair articles in my library right now in the microfilm versions of the New York Times. Those fake Glass articles from the New Republic are still in the bound periodical section of the library too. However, every edit Essjay has made is being examined and altered if it is found to be problematic. Unlike the mainstream press who have their mistakes archived forever in libraries, Wikipedia can be fixed when the fraud is discovered."&lt;/I&gt;&lt;/blockquote&gt;In &lt;A href="http://blog.xodp.org/2007/03/closure-still-lacking-in-essjay.html"&gt;a previous blog post&lt;/A&gt;, I qualified my hope that I would like to see Essjay make a fresh start at Wikipedia under a new anonymous pseudonym with the disclaimer that such a fresh start was not possible unless Essjay recants his assertion that he was offered compensation by Stacy Schiff of &lt;I&gt;The New Yorker&lt;/I&gt;.  However, I am inclined to reconsider that disclaimer in light of &lt;A href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2007-03-05/Essjay"&gt;new facts brought to light by the Wikipedia Signpost&lt;/A&gt;:&lt;blockquote&gt;&lt;I&gt;"Based on earlier statements he made, it is possible that [Essjay] used calling cards to phone Schiff for the interview, and his claim may have referred to an offer to reimburse the cost of those calls. . . .  Schiff's response [i.e., "That is nonsense."] did not address whether this alternate interpretation was correct."&lt;/I&gt;&lt;/blockquote&gt;As a (hopefully) final note, I encountered &lt;A href="http://www.freedom-to-tinker.com/?p=1129#comment-298546"&gt;a comment on one blog&lt;/A&gt; that explained to me why some people seem to be judging Wikipedia's credibility so harshly in light of Essjay's fraud:&lt;blockquote&gt;&lt;I&gt;". . . [T]rusting Wikipedia is . . .  a greater &lt;B&gt;personal&lt;/B&gt; risk than trusting the New Yorker, &lt;B&gt;regardless of whether the New Yorker is actually more trustworthy than Wikipedia&lt;/B&gt;. . .  [T]he New Yorker’s puffery about its fact-checking effectively indemnifies [someone] against the risk of looking foolish, in a way that Wikipedia does not. . . .&lt;BR&gt;&lt;BR&gt;"This sort of indemnification is precisely what professional journalists are defending when they rudely dismiss blogs–or Wikipedia–as inferior. Rather than make serious, careful comparisons between themselves and alternative information sources–comparisons that would inevitably show themselves to be seriously flawed in their own right–they emphasize the process by which they gather and vet their content."&lt;/I&gt;&lt;/blockquote&gt;</description><link>http://blog.xodp.org/2007/03/putting-essjays-fraud-into-perspective.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117313314859344421</guid><pubDate>Mon, 05 Mar 2007 23:33:00 +0000</pubDate><atom:updated>2007-04-12T09:57:54.090-07:00</atom:updated><title>Cooperative Research and Citizen Journalism</title><description>Viewer-supported &lt;a href="http://www.linktv.org/"&gt;Link TV&lt;/a&gt;, recently broadcast a program entitled &lt;A href="http://www.911pressfortruth.com/"&gt;9/11 Press For Truth&lt;/A&gt; as part one of a two part special entitled &lt;I&gt;Truth, Lies, and the Press&lt;/I&gt;.  This program was based on the work of Internet researcher Paul Thompson, author of &lt;I&gt;The Terror Timeline&lt;/I&gt; and the website that inspired the book &lt;I&gt;The Complete 9/11 Timeline&lt;/I&gt;, which is now part of the &lt;A href="http://www.cooperativeresearch.org/"&gt;Cooperative Research&lt;/A&gt; website.  I was very impressed with the content of the Link TV special, and I hope to discuss that at some point on my &lt;A href="http://blog.netesq.com/"&gt;Internet Esquire Blog&lt;/A&gt;; what I hope to cover here on the XODP Blog is the Cooperative Research website, also known as the Center for Cooperative Research.&lt;br /&gt;&lt;br /&gt;The Center for Cooperative Research is a non-profit enterprise sponsored by &lt;A href="http://www.globalvision.org/fiscal/mstatement.html"&gt;The Global Center&lt;/A&gt;, a 501(c)3 non-profit corporation, and its content is openly licensed under the &lt;A href="http://creativecommons.org/licenses/by-nc-sa/1.0/"&gt;Creative Commons Attribution-NonCommercial-ShareAlike&lt;/A&gt; license.  As open content licenses go, the Creative Commons is not one of my favorites.  However, the larger mission of the Center for Cooperative Research is one that I consider worthwhile.  To wit:&lt;blockquote&gt;&lt;I&gt;"The website is an experiment in open-content civic journalism. It allows people to investigate important issues by providing a space where people can collaborate on the documentation of past and current events, as well as the entities associated with those events. The website can be used to investigate topics at the local, regional, or global level. The data is displayed on the website in the form of dynamic timelines and entity profiles, and is exportable into XML so it can be shared with others for non-commercial purposes."&lt;/I&gt;&lt;/blockquote&gt;The site's growth potential is somewhat limited because it relies upon qualified professional volunteers to act as editors for the content submitted by member contributors.  However, if the Complete 9/11 Timeline is any indication, what the site lacks in quantity of research, it more than makes up for in terms of quality.  To wit, I was quite astonished to see just how many news items that should have been headlines were reported by the mainstream press but buried on page 11.  Once these news items are put into a timeline format, they make a compelling case for concluding that &lt;A href="http://en.wikipedia.org/wiki/Ahmed_Omar_Saeed_Sheikh#Alleged_connection_to_9.2F11"&gt;Pakistan's ISI Intelligence Service was a sponsor of Al Qaeda and the 9/11 hijackers&lt;/A&gt;.</description><link>http://blog.xodp.org/2007/03/cooperative-research-and-citizen.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117299086129945170</guid><pubDate>Sun, 04 Mar 2007 07:40:00 +0000</pubDate><atom:updated>2007-03-05T05:47:06.273-08:00</atom:updated><title>Closure Still Lacking in Essjay Scandal</title><description>The beleagured Essjay "retired" from Wikipedia today, and I was prepared to let sleeping dogs lie until I read &lt;A href="http://www.andrewlih.com/blog/2007/03/03/essjays-third-transgression/"&gt;a post at Andrew Lih's blog&lt;/A&gt;:&lt;blockquote&gt;&lt;I&gt;"Paying a source for a story is an absolute no-no in the normal practice of print journalism. And it struck me immediately how incredible it was [Essjay] would accuse Stacy Schiff, a Pulitzer Prize winning author writing for The New Yorker, of this crime. We either have a serious breach of ethics with Ms. Schiff or another dubious statement claim from Essjay (nee Ryan Jordan)."&lt;/I&gt;&lt;/blockquote&gt;Notwithstanding the hyperbole of comparing a breach of professional ethics with a crime, a survey of Google search results for &lt;I&gt;''paying a source' journalism'&lt;/I&gt; indicates that paying a source for a story is, indeed, an &lt;A href="http://findarticles.com/p/articles/mi_qa3613/is_199911/ai_n8869912"&gt;ethically murky area&lt;/A&gt;.  As such, there can little doubt that Essjay was lying, once again, adding a truly bizarre spin to an already bizarre story.&lt;br /&gt;  &lt;br /&gt;In &lt;A href="http://blog.xodp.org/2007/03/credentialists-and-impostors.html"&gt;a previous blog post&lt;/A&gt;, I stated that I would like to see Essjay make a fresh start at Wikipedia under the protection of a new, anonymous pseudonym.  However, that's not possible unless Essjay retracts this rather bizarre claim about Stacy Schiff.  And given the fact that a substantial number of Wikipedians remain stuck in denial and steadfast in their support of Essjay, it is highly unlikely that he will ever offer a retraction or an apology.</description><link>http://blog.xodp.org/2007/03/closure-still-lacking-in-essjay.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117288140379002956</guid><pubDate>Sat, 03 Mar 2007 19:57:00 +0000</pubDate><atom:updated>2007-03-03T16:28:21.300-08:00</atom:updated><title>Credentialists and Impostors</title><description>The recent intrigue over Wikipedian Essjay's phony credentials has inspired Larry Sanger to &lt;a href="http://blog.citizendium.org/2007/03/02/our-registration-policy-how-we-check-identities/"&gt;reconsider Citizendium's registration policy&lt;/A&gt;:&lt;blockquote&gt;&lt;I&gt;". . . . We are very concerned about the &lt;/I&gt;credibility&lt;I&gt; of the &lt;/I&gt;Citizendium&lt;I&gt; as a reference work. . . . We simply do not want to wake up in five years, to find that someone has done a study of the Citizendium and demonstrated that in fact 25% of all of our contributors are using neither their real names nor pre-approved pseudonyms.  In short, we've reluctantly concluded that the honor principle, even coupled with a willingness instantly to ban people like Essjay who are exposed for using false personas, really isn't due diligence."&lt;/I&gt;&lt;/blockquote&gt;Larry goes on to narrate the various methods that &lt;I&gt;Citizendium&lt;/I&gt; hopes to use to validate the identities of people who wish to contribute to the project.  To the unitiated observer, strong security measures like this probably seem to make sense.  However, my experience has been that such security measures serve as an attractive nuisance for both impostors and credentialists, and I'm not sure which is worse.&lt;br /&gt;&lt;br /&gt;Conventional wisdom would have us believe that credentials are good and that impostors (in this context, someone who pretends to have credentials that he or she does not actually possess) are bad.  However, the underlying issue is expertise, and credentials don't always translate to expertise.  As such, when someone without expertise uses credentials to win a debate, this is arguably much worse than someone with expertise pretending to have credentials.  The problem with both of these situations is that credentials, or lack thereof, become more important than expertise.&lt;br /&gt;&lt;br /&gt;While I'm a big fan of expertise, I'm not a big fan of credentials, and I'm even less fond of credentialists, as my experience has been that someone who truly has expertise will seldom feel the need to fall back on his or her credentials and will generally feel contempt for those who make a habit of doing so.  Indeed, those with true expertise are usually most conspicuous to me through their modesty and silence, and what concerns me most about Essjay's misguided actions on and off Wikipedia is that they have provided an opportunity for credentialists without expertise to stifle people without credentials who actually have expertise.&lt;br /&gt;&lt;br /&gt;Prior to &lt;A href="http://en.wikipedia.org/w/index.php?title=User_talk:Jimbo_Wales&amp;diff=prev&amp;oldid=112270687"&gt;Jimbo Wales withdrawing his support for Essjay and asking him to resign from his positions of trust at Wikipedia&lt;/A&gt;, it was my sincere hope that Essjay would see the light and do this on his own.  As it is now stands, Jimbo's statement has only partially resolved the situation.  To wit, there are a number of Wikipedians who are urging Essjay's critics to back off, just as there are quite a few people who think that Essjay should still retain his privileges as a Wikipedia administrator.  Personally, I would like to see Essjay make a fresh start at Wikipedia, and the only way that he can do that is if he resigns all privileges that he acquired under his false persona and creates a new account at Wikipedia under the protection of a new, anonymous pseudonym.&lt;br /&gt;&lt;br /&gt;Correction:  In the comments section of the above-cited Citizendium Blog post, Larry Sanger has pointed out my erroneous assertion that &lt;I&gt;Citizendium's&lt;/I&gt; recent policy change was (1) inspired by the intrigue at Wikipedia and (2) made by him and him alone.  I stand corrected. -- DFP,Jr.</description><link>http://blog.xodp.org/2007/03/credentialists-and-impostors.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117278285203567540</guid><pubDate>Thu, 01 Mar 2007 22:22:00 +0000</pubDate><atom:updated>2007-03-01T21:11:04.746-08:00</atom:updated><title>Never Underestimate the Power of Denial</title><description>In following up on &lt;A href="http://blog.xodp.org/2007/02/respected-wikipedian-lies-to-press.html"&gt;my previous XODP Blog post&lt;/A&gt; about the fraud perpetrated by the Wikipedian known pseudonymously as "Essjay," I came upon a post at Kelly Martin's Nonbovine Ruminations blog entitled, &lt;a href="http://nonbovine-ruminations.blogspot.com/2007/03/larry-sanger-proven-right-about.html"&gt;Larry Sanger proven right about Wikipedia&lt;/a&gt;.  I followed a link to the post from Google Blogsearch expecting to find a cheap shot leveled against Wikipedia by a Langer Sanger-esque Citizendium-loving credentialist, but that's not what I found:&lt;blockquote&gt;&lt;I&gt;"Larry Sanger has long argued that Wikipedia is anti-intellectual. And, sadly, he has been proven right, by Jimbo's recent fiat appointment of Essjay to the English Wikipedia's Arbitration Committee. . . ."&lt;/I&gt;&lt;/blockquote&gt;While I agree with most of what Kelly has to say about Essjay, I don't agree with her characterization of Jimbo's misguided loyalty to Essjay as being proof of Wikipedia's alleged anti-intellectual bias.  Rather, I see it as being motivated by recalcitrant denial on the part of Jimbo in re Essjay's recalcitrant denial of his [Essjay's] culpability in perpetrating such a fraud.  On this note, an update at the end of Kelly's blog post points to a recent "apology" on Essjay's user talk page at enwiki:&lt;blockquote&gt;&lt;I&gt;"I would like to clear up an oversight on my part. I was, until this morning, under the impression that in my initial post on this subject (in response to a question from Dev920 made some weeks ago) I had made an apology for anyone who felt they were hurt by my decision to use misinformation. In speaking to various different people, including Jimbo, I did make it known that I was sorry that anyone felt hurt by my actions, and I believed I had done so in my initial statement. On re-reading that, I find I did not; it was a rather lengthy statement I had been thinking about for some time, and I seem to have left out a rather critical element of it. So, I rectify that now, with further apologies that it was not included originally, as I pointed people back to that statement in the belief it was complete.&lt;br /&gt;&lt;br /&gt;"I *am* sorry if anyone in the Wikipedia community has been hurt by my decision to use disinformation to protect myself. I'm not sorry that I protected myself; I believed, and continue to believe, that I was right to protect myself, in light of the problems encountered on the internet in these trying times. I have spoken to all of my close friends here about this, and have heard resoundingly that they understand my position, and they support me. . . . I'm also sorry the New Yorker chose to print what they did about me; there seems to be a belief that I knew they were going to print it, and that is not the case. . . .&lt;br /&gt;&lt;br /&gt;"For two years, I have poured my life into making this site a better place. That many people feel hurt by my decision pains me greatly, and to them I am genuinely sorry. To the stalkers, the trolls, and the vandals, I am not sorry; they are abusive, hateful people, and they have done far worse things than those whole of the Wikipeida Community, myself included, have ever thought about doing. . . . I have no intention of going anywhere, because to do so would be to let the vandals, trolls, and stalkers win."&lt;/I&gt;&lt;/blockquote&gt;I wouldn't go so far as to say that Essjay is paranoid and delusional, but this "apology" makes it quite clear that Essjay is unapologetic about perpetrating this fraud.  He even goes so far as to blame two of his victims (i.e., Stacy Schiff and her fact checker from &lt;I&gt;The New Yorker&lt;/I&gt;) for not being more skeptical about his credentials, credentials to which he frequently averred when attempting to boost his credibility as an expert in theology.  Essjay's earlier statements on this matter left some room for reasonable doubt as to whether he was actually bragging about being able to perpetrate this fraud, but what he just doesn't get is that his earlier excuse that such a fraud was necessary, justifiable and/or excusable as a way of protecting himself from online stalkers just does not hold water.</description><link>http://blog.xodp.org/2007/03/never-underestimate-power-of-denial.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117271610542934104</guid><pubDate>Thu, 01 Mar 2007 05:31:00 +0000</pubDate><atom:updated>2007-03-01T23:27:02.436-08:00</atom:updated><title>Respected Wikipedian Lies to the Press</title><description>Stacy Schiff of &lt;I&gt;The New Yorker&lt;/I&gt; wrote &lt;a href="http://www.newyorker.com/fact/content/articles/060731fa_fact"&gt;an article about Wikipedia that was first published online in July of 2006&lt;/A&gt;.  One of Schiff's sources was a pseudonymous Wikipedian known as "Essjay" who lied to Schiff about virtually all of his biographical details:&lt;blockquote&gt;&lt;I&gt;" .  . . [Essjay] was described in the piece as 'a tenured professor of religion at a private university' with 'a Ph.D. in theology and a degree in canon law.'&lt;BR&gt;&lt;BR&gt;"Essjay was recommended to Ms. Schiff as a source by a member of Wikipedia's management team because of his respected position within the Wikipedia community. He was willing to describe his work as a Wikipedia administrator but would not identify himself other than by confirming the biographical details that appeared on his user page. . . .  Essjay now says that his real name is Ryan Jordan, that he is twenty-four and holds no advanced degrees, and that he has never taught. He was recently hired by Wikia—a for-profit company affiliated with Wikipedia—as a 'community manager'; he continues to hold his Wikipedia positions. . . ."&lt;/I&gt;&lt;/blockquote&gt;While &lt;A href="http://www.wikipedia-watch.org/essjay.html"&gt;Daniel Brandt first broke this story quite some time ago&lt;/A&gt;, it's just now starting to get some traction in the blogosphere, and it took me quite a while to uncover the details.  A good place to bring yourself up to speed is on &lt;A href="http://en.wikipedia.org/wiki/User_talk:Jimbo_Wales#The_New_Yorker_quotes_you"&gt;Jimbo Wales Wikipedia User Page&lt;/A&gt;.  However, the bottom line is that Essjay acknowledges no wrongdoing, claiming to be justified in perpetrating this fraud as a way of protecting himself from online stalkers.&lt;br /&gt;&lt;br /&gt;While it's not a crime of moral turpitude, lying to a member of the press when the truth will do just as well is evidence of moral turpitude, not to mention just plain stupid when you lie about falsifiable facts, and Essjay's excuse of necessity just doesn't wash.  To wit, in addition to lying to the press, Essjay used his counterfeit credentials to bolster his and Wikipedia's credibility in &lt;A href="http://en.wikipedia.org/wiki/User:Essjay/Letter"&gt;a letter that he gave to students&lt;/A&gt; who wanted to cite Wikipedia as a reference work.   He didn't have to give the interview, he didn't need to write the letter, and he didn't need to lie to the press to protect himself from online stalkers.&lt;br /&gt;&lt;br /&gt;I am no stranger to online stalkers.  Shortly after I founded the XODP Yahoo! eGroup, a (now former) ODP meta editor set up &lt;A href="http://web.archive.org/web/20000816190529/http://www.netesqsucks.com/"&gt;Netesqsucks.com&lt;/A&gt;, a hate site dedicated to persecuting me.  And while said meta editor set up said hate site under an assumed name and a fake address, an investigative reporter uncovered his true identity by tracking down the credit card that said stalker used to pay for the site's domain registration.  Eventually, this meta editor (a private investigator) was &lt;A href="http://tech.groups.yahoo.com/group/xodp/message/760"&gt;terminated for using sock puppets&lt;/A&gt; to promote his own websites on ODP and exclude his competitors.&lt;br /&gt;&lt;br /&gt;I didn't need to talk to the investigative reporter who uncovered the truth about the person behind the Netesqsucks.com website, but I did, and I took a huge risk in revealing to her in confidence all sorts of personal information about myself on deep background.  Because Essjay was referred to &lt;I&gt;The New Yorker&lt;/I&gt; by Wikipedia management, he was assumed to be trustworthy without providing that background information.  Simply put, Essjay violated that trust.</description><link>http://blog.xodp.org/2007/02/respected-wikipedian-lies-to-press.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117261341835335434</guid><pubDate>Tue, 27 Feb 2007 22:46:00 +0000</pubDate><atom:updated>2007-02-28T21:44:40.546-08:00</atom:updated><title>Vanity. . . Definitely My Favorite Sin</title><description>While there's still &lt;A href="http://blog.citizendium.org/2007/02/25/wikipedia-article-defames-golf-pro-who-sues-company-with-ip-address-of-anonymous-contributor/"&gt;no trackback on the Citizendium Blog&lt;/A&gt; for my previous XODP Blog post entitled &lt;A href="http://blog.xodp.org/2007/02/larry-sanger-gets-it-totally-wrong.html"&gt;Larry Sanger Gets It Totally Wrong&lt;/A&gt;, Larry has condescended to comment on my more recent post entitled &lt;a href="http://blog.xodp.org/2007/02/dealing-with-jackasses-at-wikipedia.html#117261191730441944"&gt;Dealing with Jackasses at Wikipedia and Citizendium&lt;/a&gt;:&lt;blockquote&gt;&lt;I&gt;"What argument or evidence can you offer to support this silly claim that I wish to establish 'centralized content control'?"&lt;/I&gt;&lt;/blockquote&gt;That's a good question.  But I have a much better question:  If it's such a silly claim, why are you bothering to respond to it?  Seriously.&lt;br /&gt;&lt;br /&gt;Larry continues:&lt;blockquote&gt;&lt;I&gt;"I'm surprised that you have jumped on the bandwagon of those who say, 'Oh my gosh, if there are experts involved, it must be Nupedia all over again!'"&lt;/I&gt;&lt;/blockquote&gt;Hardly a bandwagon, although I would have to agree that many other people worthy of note believe that Citizendium qualifies as a recapitulation of Nupedia.  However, I have no problem with having experts involved in wiki projects, and &lt;A href="http://blog.xodp.org/2007/01/wikipedias-bias-against-experts.html"&gt;my position on Wikipedia's unfair bias against experts&lt;/A&gt; would have been quite apparent to you if you'd carefully read the post to which you responded.&lt;br /&gt;&lt;br /&gt;Larry continues:&lt;blockquote&gt;&lt;I&gt;"The Citizendium has editors who can weigh in as "resident experts" as necessary. . . . To suggest . . . that they simply insist on their views without argument--is in essence to malign a whole bunch of people you don't know at all."&lt;/I&gt;&lt;/blockquote&gt;I can't tell you what a joy it is to have you put words in my mouth that I never said.  To wit, I did not say that "Citizendium editors simply insist on their views without argument," and suggesting that this was my position is what logicians refer to as a fallacious "straw man" argument.  For future reference, if I mean to say that Citizendium editors simply insist on their view without argument, I won't have to suggest it; I'll just say it.&lt;br /&gt;&lt;br /&gt;Larry continues:&lt;blockquote&gt;&lt;I&gt;"[C]redentials are necessary for being an editor. Not for being an author, of course, and most of our registered contributors are authors. But, yes, you have to prove that you're actually an expert."&lt;/I&gt;&lt;/blockquote&gt;The need to have a distinction between expert editors and other contributors is somewhat lost on me, but you have answered your own question regarding my characterization of your position on centralized content control.&lt;br /&gt;&lt;br /&gt;Larry continues:&lt;blockquote&gt;&lt;I&gt;"Frankly, we put out on their ear far faster than Wikipedia ever did anyone who actually acts like a jackass. Your own intemperate post, for instance, is the sort of mean-spirited, vicious personal attack that would get you excluded."&lt;/I&gt;&lt;/blockquote&gt;Hardly an intemperate post by me, but definitely an intemperate response on your part wherein you stop holding court just long enough to step outside and challenge me to to mutual combat.</description><link>http://blog.xodp.org/2007/02/vanity-definitely-my-favorite-sin.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117251297750294008</guid><pubDate>Tue, 27 Feb 2007 20:54:00 +0000</pubDate><atom:updated>2007-03-07T10:40:51.586-08:00</atom:updated><title>Dealing with Jackasses at Wikipedia and Citizendium</title><description>Following &lt;A href="http://blog.xodp.org/2007/02/larry-sanger-gets-it-totally-wrong.html#117247433822546200"&gt;a comment by one Mike&lt;/A&gt; on the XODP Blog eventually took me to said Mike's Modern Dragons Blog, which has &lt;a href="http://moderndragons.blogspot.com/2006/10/citizendium.html"&gt;a very lengthy post about Citizendium&lt;/a&gt;.  And while &lt;A href="http://moderndragons.blogspot.com/2006/10/citizendium.html#comment-3691939410543102646"&gt;Larry Sanger seems to be quite enamored with Mike's post&lt;/A&gt;, I think Mike is making extraordinary demands on his potential blogging audience by using over 5,000 words, not including the footnotes, appendices, and comments that follow the main post.  Even so, as Mike's post seems to have drawn the attention of the usual suspects, I made a point of reading through the whole thing, and I was rewarded by the following quotable quote cited by Mike:&lt;blockquote&gt;&lt;I&gt;"Many experts who have left, or otherwise have expressed dissatisfaction with Wikipedia, fall into two categories: Those who have had repeated bad experiences dealing with jackassses, and are frustrated by Wikipedia’s inability to restrain said jackasses; and those who themselves are jackasses."&lt;/I&gt;&lt;/blockquote&gt;I have stated before that &lt;A href="http://blog.xodp.org/2007/01/wikipedias-bias-against-experts.html"&gt;Wikipedia has an unfair bias against experts&lt;/A&gt;, just as I have stated before that project forking at Wikipedia is long overdue.  However, as alluded to by the quote above, experts who have left Wikipedia and jackasses who have left Wikipedia are not mutually exclusive categories.  Rather, the jackasses who make contributing to Wikipedia an unpleasant experience form a spectrum that runs from the most ignorant and uninformed contributors to the most educated and biased control freaks.  I say this as someone who voluntarily limits the vast majority of his Wikipedia contributions to "Talk Pages" rather than getting into "&lt;A href="http://en.wikipedia.org/wiki/Wikipedia:Edit_war"&gt;Edit Wars&lt;/A&gt;" with said jackasses.  Even then, I often find myself blown away by the recalcitrance of the jackasses for which Wikipedia seems to be an attractive nuisance.&lt;br /&gt;&lt;br /&gt;On this note, from the comments section of Mike's blog post comes the following gem from Wikipedian Fred Bauder:&lt;blockquote&gt;&lt;I&gt;"Most Wikipedians support Citizendium, but, having had experience with Larry, most old-timers are somewhat sceptical[sic] about a project controlled by him. Given a choice, he will nearly always choose a solution which involves top-down control. . . . Another problem, he does not so much respect expertise, as credentials. . . ."&lt;/I&gt;&lt;/blockquote&gt;Having had &lt;A href="http://en.wikipedia.org/w/index.php?title=Talk:Law/Old_2&amp;oldid=216963"&gt;direct contact with Fred Bauder during my early editing experiences at Wikipedia&lt;/A&gt;, I can honestly say that he was one of the jackasses at Wikipedia that inspired me to give up the proverbial ghost when it came to quality control.  I was particularly annoyed at his penchant for moving forward with dramatic changes to an existing Wikipedia article without bothering to form a consensus.  And in all fairness to Fred, there are probably just as many people at Wikipedia who think that I'm a jackass.&lt;br /&gt;&lt;br /&gt;Ironically enough, the dispute that I had with Fred involved his revisions to the Wikipedia &lt;I&gt;Law&lt;/I&gt; article, and Fred is a retired lawyer, which should give him more than a passing familiarity with the topic.  And while I do not practice law, I was a Member of Law Review and a Teacher's Assistant for Legal Writing before graduating from UC Davis Law School.  As such, from a Larry Sanger-esque credentials standpoint, I think that I too would have been on pretty firm ground.  Even so, the most constructive input regarding the Wikipedia &lt;I&gt;Law&lt;/I&gt; article that came during my confrontation with Fred came from Wikipedian Slrubenstein (a very modest professor of anthropology), mysterious old-time Wikipedian SJK, and noted Wikipedian Lee Daniel Crocker (a computer programmer by trade who was the primary author of the current MediaWiki software).&lt;br /&gt;&lt;br /&gt;By the time I started contributing to Wikipedia in August of 2002, the writing was already on the wall regarding Larry Sanger's once prominent role as Wikipedia's editor in chief.  And having successfully cultivated more than one online community of several hundred members (i.e., &lt;A href="http://web.archive.org/web/20000816001109/www.wherewithal.com/odpland.html"&gt;Wherewithal&lt;/A&gt; and &lt;A href="http://web.archive.org/web/20010201203500/beta.projectnapa.com/"&gt;Project Napa&lt;/A&gt;) and then having had both of those communities taken away from me and destroyed by the people who hired me to cultivate those communities, I was inclined to feel sorry for Larry.  However, the more I came to understand Larry's feelings about what Wikipedia should be, the more I came to realize that Wikipedia was much better off without someone like Larry in charge.&lt;br /&gt;&lt;br /&gt;While Fred Bauder may or may not speak for the majority of old timers at Wikipedia, (IMHO) he is spot on in his criticism of Larry Sanger.  One need not look far for evidence of Larry's failings:  Nupedia failed because it favored credentialism and centralized control over content, and once Wikipedia emerged as a viable replacement for Nupedia, Sanger sought to impose credentialism and centralized content control at Wikipedia.  Even now, Larry hopes to use Citizendium to reassert the validity of credentialism and centralized content control as a panacea for what ails Wikipedia.&lt;br /&gt;&lt;br /&gt;I mentioned above that during my tenure at UC Davis Law School I was a Teacher's Assistant for Legal Writing and a Member of Law Review.  These positions gave me an enormous appreciation for just how difficult it is for most people to produce quality writing, just how defensive most people can be about really bad writing, and just how difficult it can be to reach a consensus on what good writing actually is when more than one editor's viewpoint must be appeased.  As a result of these harsh realities, the vast majority of promising law review candidates would wash out of law review before the end of their first semester, and the author of an article that qualified him or her for law review membership was seldom willing to claim the final product as his or her own.    &lt;br /&gt;&lt;br /&gt;Dynamics similar to those that I experienced at UC Davis Law Review are at work at Wikipedia, but with a rather strange twist:   Anybody can put in their own .02 in on a Wikipedia article, and quite a few people do.  To wit, while most Wikipedia articles end up having a relatively small number of regular and interested contributors, &lt;A href="http://www.aaronsw.com/weblog/whowriteswikipedia"&gt;the vast majority of content on Wikipedia is contributed by the rare occasional contributor&lt;/A&gt;.  What this means is that the occasional contributor is the key ingredient to Wikipedia's exponential growth but is not even a significant part of Wikipedia ongoing quality control problems.  Those problems are caused by biased and/or  uninformed experts and zealots -- i.e., jackasses.&lt;br /&gt;&lt;br /&gt;Rather than encourage the occasional contributor who has helped Wikipedia grow and prosper and discourage jackasses from asserting ownership over particular articles, Larry Sanger hopes to improve upon the Wikipedia model by excluding from Citizendium the people who provide the vast majority of useful content and forming an enclave of like-minded expert jackasses.  It's an interesting experiment, but -- given the enormous barriers to entry at Citizendium -- not one that I or any of the experts that I know are interested in joining.  And it's probably just as well:  Larry Sanger is apparently moderating all comments and trackbacks to the Citizendium blog, as all the comments appearing there harmonize with his point of view and &lt;A href="http://blog.citizendium.org/2007/02/25/wikipedia-article-defames-golf-pro-who-sues-company-with-ip-address-of-anonymous-contributor/"&gt;no trackback reference&lt;/A&gt; to my previous blog post has yet appeared.</description><link>http://blog.xodp.org/2007/02/dealing-with-jackasses-at-wikipedia.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117246363477968549</guid><pubDate>Mon, 26 Feb 2007 05:30:00 +0000</pubDate><atom:updated>2007-02-27T20:57:42.986-08:00</atom:updated><title>Larry Sanger Gets It Totally Wrong</title><description>While exploring the blogosphere for commentary about Wikipedia, my attention was drawn to &lt;a href="http://blog.citizendium.org/2007/02/25/wikipedia-article-defames-golf-pro-who-sues-company-with-ip-address-of-anonymous-contributor/"&gt;a post by Larry Sanger on the Citizendium Blog&lt;/a&gt;:&lt;blockquote&gt;&lt;I&gt;". . . Wikipedia's reach is now enormous, and if indeed it has gained a reputation, whether deserved or not, as a source of reasonably reliable information, and it defames someone for any significant length of time, such defamation can do very real harm to a person’s reputation. . . .&lt;BR&gt;&lt;BR&gt;"[ . . .]&lt;BR&gt;&lt;BR&gt;". . . Ethically, and probably legally, Wikipedia’s managers must face up to it, because the injustice the current situation perpetrates is completely and &lt;/I&gt;obviously&lt;I&gt; intolerable.&lt;/I&gt;&lt;/blockquote&gt;While Wikipedia may or may not have an ethical duty to prevent people from being defamed &lt;I&gt;&lt;B&gt;*on*&lt;/B&gt;&lt;/I&gt; Wikipedia, an issue on which reasonable minds may differ, the only thing that I find "obvious" on the issue of defamation &lt;I&gt;&lt;B&gt;*by*&lt;/B&gt;&lt;/I&gt; Wikipedia is that Larry Sanger knows absolutely nothing about defamation law.&lt;br /&gt;&lt;br /&gt;BTW, as I am wont to say when commenting on legal issues, I do not practice law, and nothing I have written in this post or elsewhere should be construed as a legal opinion or as legal advice.  That having been said, &lt;A href="http://www.msnbc.msn.com/id/17281907/"&gt;according to Attorney Scott D. Sheftall&lt;/A&gt;, the lawyer who is representing Fuzzy Zoeller in the recent defamation suit involving Wikipedia, the law is pretty clear when it comes to the fact that Wikipedia has no legal exposure here:&lt;blockquote&gt;&lt;I&gt;"Zoeller's attorney, Scott D. Sheftall, said he filed the lawsuit against a Miami firm last week because the law won't allow him to sue St. Petersburg-based Wikipedia."&lt;/I&gt;&lt;/blockquote&gt;The irony here is that Sanger's ongoing indictment of Wikipedia is that it allegedly fails to provide accurate information.  In Sanger's own words:&lt;blockquote&gt;&lt;I&gt;"Articles that contain defamatory remarks are not 'objectionable comments' on a 'message board'; they are presented to the world and typically accepted as fact, which is something Jimmy himself encourages by saying, as he does, that Wikipedia is actually pretty reliable."&lt;/I&gt;&lt;/blockquote&gt;This is a classic case of the pot calling the kettle black, and a pretty good case could be made that Sanger's comments about Jimbo Wales are defamatory:&lt;blockquote&gt;&lt;I&gt;"It is reprehensible, moreover, to react to this situation lackadaisically, regarding false claims that are personally and professionally damaging as in effect 'collateral damage' that society must bear as the price of having such a wonderful project as Wikipedia.  That reaction is reprehensible because it assumes that Wikipedia's managers cannot improve the way that it deals with this sort of problem.  They could, without completely breaking its system; but they choose not to."&lt;/I&gt;&lt;/blockquote&gt;According to &lt;A href="http://www.eff.org/bloggers/lg/faq-defamation.php"&gt;the guidelines offered by the Electronic Frontier Foundation in re defamation&lt;/A&gt;, an attorney could probably make a &lt;I&gt;prima facie&lt;/I&gt; case that Sanger is publishing untrue statements about Wales with the intention of impuning Wales' reputation.  However, given that Wales would probably qualify as a public figure, said attorney would also have to prove "actual malice" on the part of Sanger.  Nonetheless, I think it's incumbent upon Sanger to live up to the standards that he hopes to enforce:&lt;blockquote&gt;&lt;I&gt;"[Citizendium] will have a zero tolerance policy toward any even possibly defamatory remarks: to say something that might tend to impugn someone’s reputation, even if true, will require extremely good documentation.  If no such documentation is offered, or if it does not check out, the person who makes such claims will be "escorted to the door."  We simply won't tolerate it."&lt;/I&gt;&lt;/blockquote&gt;</description><link>http://blog.xodp.org/2007/02/larry-sanger-gets-it-totally-wrong.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117238003669672059</guid><pubDate>Mon, 26 Feb 2007 03:39:00 +0000</pubDate><atom:updated>2007-02-25T19:55:35.043-08:00</atom:updated><title>Powerset Redux</title><description>In &lt;A href="http://blog.xodp.org/2007/02/powerset-counts-coup-with-google.html"&gt;a previous XODP Blog post&lt;/A&gt;, I commented on the fact that &lt;A href="http://www.powerset.com"&gt;Powerset&lt;/A&gt; had succeeded where &lt;A href="http://www.google.com/"&gt;Google&lt;/A&gt; had failed in wooing natural language processing (NLP) guru &lt;A href="http://www2.parc.com/istl/members/kaplan/"&gt;Ronald Kaplan&lt;/A&gt; to spearhead its efforts at developing a semantic search engine.  Unlike most of the commentators that I've read on the blogosphere, I was willing to suspend judgment on Powerset's technology until it came out of stealth mode.  However, I remained titillated by the prospect of semantic search finally coming of age, so I did a little more digging, and I found some more information about &lt;a href="http://datamining.typepad.com/data_mining/2006/09/powerset_update.html"&gt;what Powerset is planning to debut&lt;/a&gt;:&lt;blockquote&gt;&lt;I&gt;"There are two key things here: the use of NLP and the disruption to the search interface. Finally, information retrieval will actually mean information retrieval, not document retrieval. One of the fundamental models of search that may be challenged . . . is . . . that search engines are designed to take people to pages. The more we can understand and summarize the information on those pages, the weaker this model becomes. . . ."&lt;/I&gt;&lt;/blockquote&gt;[The above summary comes from the blog of &lt;A href="http://www.nielsenbuzzmetrics.com/mgmnt_matthewhurst.asp"&gt;Matthew Hurst&lt;/A&gt;, a close friend of Powerset's CEO &lt;A href="http://www.barneypell.com/about.html"&gt;Barney Pell&lt;/A&gt;.]&lt;br /&gt;&lt;br /&gt;As promising as this vision of search is, it fails to take into account &lt;A href="http://earningmyturns.blogspot.com/2007/01/cost-of-search-computations.html"&gt;the cost of search computations&lt;/A&gt;:&lt;blockquote&gt;&lt;I&gt;"The computation for each query has quantifiable costs and benefits. The costs include R&amp;D, computing infrastructure amortization, energy, rent, maintenance. The direct benefit is advertising revenue from the query response; indirect benefits such as market share gained from greater search quality are harder to measure, but can still be estimated. Search engine success is ultimately given by the efficiency with which it delivers advertising revenue given these factors."&lt;/I&gt;&lt;/blockquote&gt;This sort of cost/benefit analysis gives rise to &lt;A href="http://blog.xodp.org/2005/11/what-lurks-in-googles-soul.html"&gt;an inherent conflict of interest in search engine technology&lt;/A&gt; as we know it.  To wit, if a search engine's natural search results are too good, there's no incentive to click on paid search results, and the business model breaks down.  At the other extreme is an information resource like &lt;A href="http://www.wikipedia.org/"&gt;Wikipedia&lt;/A&gt;, with a business model that is based primarily upon a &lt;A href="http://www.firstmonday.org/issues/issue3_12/barbrook/"&gt;gifting economy&lt;/A&gt;.  That sort of business model also breaks down when the quality of information it provides gets too good, as it attracts all sorts of people whose primary interest is in &lt;A href="http://www.time.com/time/magazine/article/0,9171,1590440,00.html"&gt;getting rich off of those who work for free&lt;/A&gt;.&lt;br /&gt;&lt;br /&gt;To quote the late Keith Moon, "Sometimes you do alright to steer clear of quality."  (A quote that I found &lt;A href="http://bigpirate.blogspot.com/2005/07/focus-part-iii.html"&gt;misattributed to Pete Townshend&lt;/A&gt;.) Nowhere is this more true than in it is when it comes to technology, where &lt;A href="http://tech.groups.yahoo.com/group/xodp/message/2259"&gt;a successful disruptive technology (i.e,. blogs)&lt;/A&gt; usually starts out as an inferior technology.  Assuming, &lt;I&gt;arguendo&lt;/I&gt;, that Powerset's technology truly is better than Google's, this is the real challenge that Powerset's evangelists will have to address.</description><link>http://blog.xodp.org/2007/02/powerset-redux.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117212508721282816</guid><pubDate>Sat, 24 Feb 2007 21:48:00 +0000</pubDate><atom:updated>2007-02-24T15:08:00.293-08:00</atom:updated><title>Rumors of Wikipedia's Imminent Demise Are Greatly Exagerrated</title><description>Following up on some recent XODP Blog posts about Wikipedia, I noticed that it has become somewhat fashionable to predict Wikipedia's imminent demise, the most recent example of this that I found being Peter Da Vanzo's post at the v7n blog, &lt;a href="http://blog.v7n.com/2007/02/21/have-dmoz-taken-over-wikipedia/"&gt;Have DMOZ Taken Over Wikipedia?&lt;/a&gt;&lt;blockquote&gt;&lt;I&gt;"DMOZ and Wikipedia share much in common - other than Wikipedia isn't completely useless. Yet.&lt;BR&gt;&lt;BR&gt;"[ . . . ]&lt;BR&gt;&lt;BR&gt;. . . You build something that looks open, and appears to be open, but in reality, is locked up tight, and run by a small group of people making ever more insular decisions.&lt;BR&gt;&lt;BR&gt;". . . [T]hey're under-resourced for the task, and as the task grows, the more under-resourced they become. In response, they compromise the very thing that made them valuable - accessibility."&lt;/I&gt;&lt;/blockquote&gt;I've had recurring issues with Wikipedia since I first started contributing to it back in August of 2002, and I think that some project forking is long overdue.  However, I think that Da Vanzo's indictment of Wikipedia is gratuitiously harsh.  Ditto for the &lt;A href="http://www.roughtype.com/archives/2006/05/the_death_of_wi.php"&gt;indictment of Wikipedia proffered by Nicholas Carr&lt;/A&gt; back in May of 2006:&lt;blockquote&gt;&lt;I&gt;"Wikipedia, the encyclopedia that "anyone can edit," was a nice experiment in the "democratization" of publishing, but it didn't quite work out. Wikipedia is dead. It died the way the pure products of idealism always do, slowly and quietly and largely in secret, through the corrosive process of compromise.&lt;BR&gt;&lt;BR&gt;". . . A few months ago, in the wake of controversies about the quality and reliability of the free encyclopedia's content, the Wikipedian powers-that-be - . . .  tightened the restrictions on editing. In addition to banning some contributors from the site, the administrators adopted an "official policy" of what they called, in good Orwellian fashion, "semi-protection" to prevent "vandals" (also known as people) from messing with their open encyclopedia."&lt;/I&gt;&lt;/blockquote&gt;A much more muted prediction of Wikipedia's demise was offered by Eric Goldman on his Technology and Marketing Law Blog entitled &lt;A href="http://blog.ericgoldman.org/archives/2006/12/wikipedia_will_1.htm"&gt;Wikipedia Will Fail in Four Years&lt;/A&gt;:&lt;blockquote&gt;&lt;I&gt;". . . I'm . . . basing this prediction on the experiences of ODP. I think it's fair to say that (1) in its heyday, the ODP did an amazing job of aggregating free labor to produce a valuable database, and (2) the ODP is now effectively worthless."&lt;/I&gt;&lt;/blockquote&gt;While I agree that Wikipedia has all sorts of limitations and faces all sorts of challenges, I also think that predicting Wikipedia's demise based on comparisons to ODP/dMOZ is very simplistic thinking.&lt;br /&gt;&lt;br /&gt;The single biggest problem with the purportedly Open Directory Project was that &lt;A href="http://www.traffick.com/article.asp?aID=62"&gt;it was never, ever truly open&lt;/A&gt;.  A close second was the fact that ODP never had a business plan.  Neither of these things can be said about Wikipedia.  Rather, &lt;A href="http://blog.xodp.org/2006/08/wikipedia-and-semantic-web.html"&gt;the single biggest problem with Wikipedia is quality control&lt;/A&gt;, and a close second is &lt;A href="http://meta.wikimedia.org/wiki/Regime_change"&gt;the bizarre bureaucracy that has been slowy emerging&lt;/A&gt; in response to the problem of quality control.  The fallback solution to both of these problems and most of the other problems that open projects encounter is project forking, something that could not happen with ODP because of its corporate ownership, its &lt;A href="http://www.rainwaterreptileranch.org/steve/sw/odp/rdflist.html#15"&gt;onerous licensing restrictions&lt;/A&gt;, and its failure and refusal to use an open source software platform.</description><link>http://blog.xodp.org/2007/02/rumors-of-wikipedias-imminent-demise.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117211258892244469</guid><pubDate>Thu, 22 Feb 2007 20:24:00 +0000</pubDate><atom:updated>2007-02-22T12:28:31.810-08:00</atom:updated><title>Grokking the Semantic Web</title><description>Although I've written about &lt;A href="http://blog.xodp.org/2006/08/what-is-semantic-web.html"&gt;what the Semantic Web is&lt;/A&gt; previously, I've always felt that there should be a simpler way to explain it.  And while exploring the blogosphere, I found &lt;a href="http://moresemantic.blogspot.com/2006/11/wikipedia-to-serve-as-global-ontology.html"&gt;a blog post on ....more semantic!&lt;/A&gt; that finally did just that by referencing Wikipedia as a global ontology:&lt;blockquote&gt;&lt;I&gt;"[Y]ou could . . . use a wikipedia reference to indicate the semantic concept that you are writing about.  Thus, . . . you could use the link http://en.wikipedia.org/wiki/Rome to indicate that you are refering to the city of Rome, the capital of Italy."&lt;/I&gt;&lt;/blockquote&gt;This simple statement sums up just how straightforward it would be for someone to publish a document on the World Wide Web that is compatible with the Semantic Web.  It also makes it abundantly clear just how tedious and time consuming such a task would be.&lt;br /&gt;&lt;br /&gt;Most people who describe the Semantic Web talk about making web documents easier for machines to process.  That's true enough, but that's really not what the Semantic Web is all about.  At its essence, the Semantic Web is all about disambiguation, &lt;A href="http://blog.xodp.org/2006/08/keywords-semantics-and-disambiguation.html"&gt;something that Wikipedia does quite well&lt;/A&gt;, even breaking through language barriers in the process.&lt;br /&gt;&lt;br /&gt;The other day, a friend of mine who is taking a conversational Spanish class asked me if I could help her find something that was written in Spanish to complete an assignment for her class.  I immediately thought of Wikipedia and how it links from articles in one language to articles in another language, so I told my friend to find a Wikipedia article in English that she liked and follow the link to the Spanish Wikipedia.  I then directed her to use &lt;A href="http://babelfish.altavista.com/"&gt;Babelfish&lt;/A&gt; for a crude translation of the article on the Spanish Wikipedia.&lt;br /&gt;&lt;br /&gt;The Wikipedia article that my friend picked was &lt;I&gt;Cat&lt;/I&gt;.  However, when I followed the link to the corresponding article on the Spanish Wikipedia, I discovered that the two articles were quite different, so I found the appropriate article on the Spanish Wikipedia and changed the outgoing link on the English Wikipedia.  Prior to the change I made, the English Wikipedia considered &lt;I&gt;Cat&lt;/I&gt; to be a synonym for &lt;I&gt;House cat&lt;/I&gt;, but linked to the more generic Spanish Wikipedia article entitled &lt;I&gt;Felis&lt;/I&gt; rather than the more specific article entitled &lt;I&gt;Gato doméstico&lt;/I&gt;, which in turn redirected to &lt;I&gt;Felis silvestris catus&lt;/I&gt;.&lt;br /&gt;&lt;br /&gt;This simple exercise demonstrates how a semantic search engine algorithm could drastically improve the relevancy of keyword-based search results, something that I hinted at in my previous XODP Blog post entitled &lt;A href="http://blog.xodp.org/2007/02/wordnet-disambiguation-and-semantic.html"&gt;Wordnet, Disambiguation, and the Semantic Web&lt;/A&gt;.  And contrary to what some have suggested, a semantic search engine would not need to be intimately familiar with a user and/or the context of a particular user's search.  It could default &lt;A href="http://en.wikipedia.org/wiki/Statistical_semantics"&gt;statistically&lt;/A&gt; to the most likely meaning of a particular word occurrence and still allow an end user to provide feedback on what he or she really meant.&lt;br /&gt;&lt;br /&gt;All of this begs (or rather raises) the question of whether people need or want a user agent that is this sophisticated when it comes to searching the Web.  I've done a significant amount of end user training, both for my private clients (most of whom are lawyers) and during &lt;A href="http://www.lorman.com/bookstore/bookstore_details.php?sku=359509"&gt;presentations that I've made at Mandatory Continuing Legal Education (MCLE) seminars&lt;/A&gt;, and few people ever tax my knowledge base with their questions.  In fact, I got my best reviews (5 out of a possible 5 for 90 percent of the queries asked of seminar attendees) by taking a full 30 minutes to explain the anatomy of hypertext links, knowledge that most Web-savvy individuals take for granted.  Thus, I am inclined to believe that both the Semantic Web and semantic search will remain solutions looking for a problem for the foreseeable future; at best, they may become solutions made by geeks and for geeks.</description><link>http://blog.xodp.org/2007/02/grokking-semantic-web.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117208991224334306</guid><pubDate>Wed, 21 Feb 2007 23:15:00 +0000</pubDate><atom:updated>2007-02-22T04:44:59.393-08:00</atom:updated><title>Linking Wikipedia Articles to ODP/dMOZ Categories</title><description>While following up on a previous XODP Blog post &lt;A href="http://blog.xodp.org/2007/02/odpdmoz-in-memoriam-once-again.html"&gt;narrating the death and resurrection of ODP&lt;/A&gt;, I found a link in &lt;A href="http://www.texttechnologies.com/2007/02/07/dmoz-cure-wikipedia-spam-problem"&gt;a post on the Text Technologies blog&lt;/A&gt; pointing to a post on Joost de Valk's blog entitled &lt;a href="http://www.joostdevalk.nl/blog/dmoz-and-wikipedia-how-it-should-work/"&gt;DMOZ and Wikipedia: how it should work&lt;/a&gt;:&lt;blockquote&gt;&lt;I&gt;"Wikipedia is an online encyclopedia. . . . An encyclopedia should contain references to other articles in that encyclopedia, but doesn’t nescessarily have to have external links. A directory’s major purpose (at least the ODP’s major purpose) is to contain external links, chosen by editors to be of high quality.&lt;BR&gt;&lt;BR&gt;"[ . . . ]&lt;BR&gt;&lt;BR&gt;"On some articles, editors have linked to DMOZ, and removed almost all external links. That, to me, looks very good. The links in DMOZ have been checked, and should be of high value, in Wikipedia, this is impossible to do, as anyone can add his [or her] own link."&lt;/I&gt;&lt;/blockquote&gt;A truly open Web directory would be a great companion to &lt;A href="http://www.wikipedia.org/"&gt;Wikipedia&lt;/A&gt;, but linking a Wikipedia article to &lt;A href="http://dmoz.org/"&gt;ODP&lt;/A&gt; and eliminating all other outbound links from that article will create many more problems than it will solve, &lt;I&gt;a fortiori&lt;/I&gt; when one considers the uncertain future of ODP.&lt;br /&gt;&lt;br /&gt;Astonishingly, no one seems to be pointing out just how bad this idea really is, much less why.  When it comes to quality control, ODP's track record is much worse than Wikipedia's, so much so that both &lt;A href="http://blogs.msdn.com/livesearch/archive/2006/05/22/603917.aspx"&gt;MSN&lt;/A&gt; and &lt;A href="http://sitemaps.blogspot.com/2006/07/more-control-over-page-snippets.html"&gt;Google&lt;/A&gt; now allow webmasters to opt out of using ODP's meta data for their site descriptions, whereas Yahoo! has its own human editors to provide site descriptions.  Add to this ODP's historical backlog of hundred of thousands (perhaps millions?) of sites, with a typical wait of some six months or more for a site submission to be reviewed, along with the total lack of transparency in the purportedly &lt;U&gt;&lt;B&gt;&lt;I&gt;Open&lt;/U&gt;&lt;/B&gt;&lt;/I&gt; Directory Project, and Joost's idea makes about as much sense as amending the United States Constitution so that the current President Bush can be elected for a third term and stay the course in Iraq.&lt;br /&gt;&lt;br /&gt;Assuming, &lt;I&gt;arguendo&lt;/I&gt;, that ODP did not have systemic issues with quality control, was not a black hole for site submissions, and actually was an &lt;I&gt;&lt;B&gt;&lt;U&gt;open&lt;/U&gt;&lt;/B&gt;&lt;/I&gt; directory, the idea of removing relevant outbound links from Wikipedia and replacing them with one link to a purportedly authoritative and comprehensive Web directory would be bad enough all by itself.  One of the best things about Wikipedia is that it is &lt;A href="http://blog.xodp.org/2006/08/keywords-semantics-and-disambiguation.html"&gt;a centralized clearinghouse for information on keyword-based topics&lt;/A&gt;, and a list of relevant outbound links is a logical component of such an online reference.  To this end, many (if not most) Wikipedia articles include easily verifiable citations that link to other websites.  Consequently, I find it hard to believe that more than a handful of Wikipedians would take Joost's suggestion seriously, but well over a thousand Wikipedia articles &lt;A href="http://en.wikipedia.org/wiki/Special:Whatlinkshere/Template:Dmoz"&gt;currently use a DMOZ link template&lt;/A&gt;, and &lt;A href="http://en.wikipedia.org/wiki/Wikipedia:External_links"&gt;Wikipedia's external links guideline&lt;/A&gt; was revised on November 16, 2006 to make the use of this template the norm.  Notwithstanding the six week outage of ODP's website, this recommendation remains the &lt;I&gt;status quo&lt;/I&gt;.</description><link>http://blog.xodp.org/2007/02/linking-wikipedia-articles-to-odpdmoz.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117207723797784678</guid><pubDate>Wed, 21 Feb 2007 19:13:00 +0000</pubDate><atom:updated>2007-02-21T11:19:29.630-08:00</atom:updated><title>ODP/dMOZ - In Memoriam, Once Again</title><description>While I occasionally mention ODP/dMOZ in a historical context, particularly so in reference to &lt;A href="http://blog.xodp.org/2007/01/promise-of-wikiseek.html"&gt;Wikipedia inheriting whatever creative genius ODP once had&lt;/A&gt;, I wrote off ODP quite some time ago.  Indeed, I had almost written off Web directories altogether until I stumbled upon Robert Barger and Brian Prince at Search Engine Strategies 2005 and discovered that they and others like them had successfully &lt;A href="http://tech.groups.yahoo.com/group/xodp/message/2398"&gt;reinvented the concept of indexing communities&lt;/A&gt;.  Even so, when enough people find some meat on ODP's rotting horse carcass, I write yet another post-mortem on the topic.&lt;br /&gt;&lt;br /&gt;Notwithstanding a dismissive post that I wrote on the XODP Blog on June 19, 2006 entitled &lt;A href="http://blog.xodp.org/2006/06/will-odp-ever-die.html"&gt;Will ODP Ever Die?&lt;/A&gt;, my last extensive &lt;A href="http://blog.xodp.org/2006/03/post-mortem-on-odp.html"&gt;post-mortem on ODP&lt;/A&gt; was posted on the XODP Blog on March 23, 2006, just under a year ago, so I guess it's about time for yet another one.  To this end, Robozilla posted &lt;A href="http://tech.groups.yahoo.com/group/xodp/message/2481"&gt;a brief message on the XODP Yahoo! eGroup&lt;/A&gt; pointing to a post on Richard Skrenta's blog from December 16, 2006 entitled, &lt;a href="http://www.skrenta.com/2006/12/dmoz_had_9_lives_used_up_yet.html"&gt;DMOZ had 9 lives. Used up yet?&lt;/a&gt;.  I'm sorry that I'm so late in getting to the party, but better late than never.  In any event, the horse has been dead for quite some time and has been hoisted up on a rope and hit so many times that it's beginning to resemble a piñata, the difference being that there's no candy on the inside of this beast.&lt;br /&gt;&lt;br /&gt;For those of you who are unaware of the recent drama over at ODP, here's a pithy excerpt from Skrenta's blog:&lt;blockquote&gt;&lt;I&gt;"Apparently the machine holding dmoz in AOL ops crashed. Standard backups had been discontinued for some reason; during unsuccessful attempts to restore some of the lost data, ops blew away the rest of the existing data on the system.&lt;BR&gt;&lt;BR&gt;"So for the past 6 weeks, a few folks have been trying to patch the system back together again (reverse engineering from the latest RDF dump, I suppose). &lt;BR&gt;&lt;BR&gt;"dmoz doesn't exactly operate on a model of transparency, to say the least, so they have been keeping the details of what happened private. . . ."&lt;/I&gt;&lt;/blockquote&gt;I first became aware of this drama when the ODP article at Wikipedia (which is still on my Wikipedia watchlist) started to see &lt;A href="http://en.wikipedia.org/w/index.php?title=Open_Directory_Project&amp;diff=83061797&amp;oldid=82951181"&gt;some really weird edits&lt;/A&gt;.  Given what I know of the inner workings of AOL, I gave ODP about a 50/50 chance of disappearing from AOL, the way that Disney's &lt;A href="http://tech.groups.yahoo.com/group/xodp/message/610"&gt;Go Directory&lt;/A&gt; and Looksmart's &lt;A href="http://blog.xodp.org/2006/03/post-mortem-on-zeal.html"&gt;Zeal directory&lt;/A&gt; disappeared, with the ODP editor community reconstituting itself in one form or another on one or more websites using one of the not-so-recent ODP RDF dumps.  I also anticipated that Richard Skrenta and Jimbo Wales would step up to the plate and offer whatever assistance they could.  But for the fact that someone over at AOL took notice of Skrenta's blog post, that's what would have happened.  As it now stands, ODP's systems have been more or less restored to the &lt;I&gt;status quo ante&lt;/I&gt;, for better or for worse.&lt;br /&gt;&lt;br /&gt;With the exception of a very small handful of true believers, most of the people who have ever had any dealings with ODP are either very apathetic about it (i.e., me) or have a very negative opinion of it.  &lt;A href="http://seoblog.intrapromote.com/2006/12/dmoz_please_die.html"&gt;Many people would like to see ODP completely extinguished&lt;/A&gt;, but the ongoing ODP RDF dumps and resulting ODP clones that currently populate the Web make that an impossibility.  The best case scenario is one where a white knight like Jimbo Wales would be allowed to give the current ODP a new home.&lt;br /&gt;&lt;br /&gt;More than one person with the resources to make it happen (not Jimbo Wales) has asked me if I would be interested in spearheading an effort to reorganize ODP, the last time being about a year and a half ago.  And my answer has always been, "Sure, for the right price.  However, if you have those sort of financial resources, there's quite a few other projects that I consider more worthwhile."  And that really is the problem with ODP.  With the rare exception of someone like Jimbo Wales, the only people with the wherewithal to salvage what's left of ODP are either totally disinterested, totally clueless, totally in denial, or totally corrupt.  Moreover, while AOL does not seem to be interested in revitalizing ODP, &lt;A href="http://sorvoja.com/2006/12/13/will-aol-allow-a-open-directory-foundation"&gt;it is dead set on making sure that no one takes it away from them&lt;/A&gt;.</description><link>http://blog.xodp.org/2007/02/odpdmoz-in-memoriam-once-again.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117199081114352359</guid><pubDate>Tue, 20 Feb 2007 18:21:00 +0000</pubDate><atom:updated>2007-02-20T10:23:35.150-08:00</atom:updated><title>WordNet Redux</title><description>In &lt;A href="http://blog.xodp.org/2007/02/wordnet-disambiguation-and-semantic.html"&gt;a recent XODP Blog post&lt;/A&gt;, I extolled the virtues of &lt;A href="http://wordnet.princeton.edu/"&gt;WordNet&lt;/A&gt; and declared that it was one of the best kept secrets when it comes to Natural Language Processing (NLP) resources.  Following up on this post, I found that few people outside of the NLP arena seem to have even heard of WordNet, and even the experts in this field do not seem to appreciate WordNet's potential.  For instance, in &lt;A href="http://artificial-artificial-intelligence.com/index.php?title=wordnet&amp;amp;more=1&amp;amp;c=1&amp;amp;tb=1&amp;amp;pb=1"&gt;a post at the Artificial Artifical Intelligence Blog&lt;/A&gt;, Lukas Biewald laments:&lt;blockquote&gt;&lt;I&gt;"Are concepts really a hierarchy? I’ve heard cognitive scientists think so, but I disagree. And I think that trying to make all the concepts conform to this artificial hierarchal structure has turned WordNet into a much less useful resource.&lt;BR&gt;&lt;BR&gt;"[ . . .]&lt;BR&gt;&lt;BR&gt;". . . [Some] groups of concepts . . . actually have a hierarchical structure for an unrelated real-world reason. . . .&lt;BR&gt;&lt;BR&gt;"[ . . .]&lt;BR&gt;&lt;BR&gt;"But this hierarchy completely breaks down for more conceptual things. Is respect in the sense of 'respect for my Father,' a type of 'attitude'' or 'politeness' or 'filial duty' or 'affection?' Clearly it’s all these things. But the guys making WordNet didn’t want to believe that, so they make respect as a type of attitude one semantic category, and respect as a type of politeness another, and so on, until there are ten separate senses for respect."&lt;/I&gt;&lt;/blockquote&gt;In their book, &lt;I&gt;Computational Linguistics&lt;/I&gt;, which I mentioned in my previous post about WordNet, Igor Bolshakov and Alexander Gelbukh anticipate these sort of objections to creating linear representations (i.e., &lt;A href="http://www.gelbukh.com/clbook/Computational-Linguistics.htm#_Toc86751668"&gt;Text&lt;/A&gt;) of non-linear entities (i.e., Meaning):&lt;blockquote&gt;&lt;I&gt;". . . The human had to be satisfied with the instrument of speech given to him by nature.  This is why we use while speaking a linear and rather slow method of acoustic coding of the information we want to communicate to someone else.&lt;BR&gt;&lt;BR&gt;"[ . . . ]&lt;BR&gt;&lt;BR&gt;"While the information contained in the text can have a very complicated structure, with many relationships between its elements, the text itself has always one-dimensional, linear nature, given letter by letter. . . . [A] text represents non-linear information transformed into linear form.  What is more, the human cannot represent in usual texts even the restricted non-linear elements of spoken language, namely, intonation and logical stress. . . .&lt;BR&gt;&lt;BR&gt;". . . A text consists of elementary pieces having their own, usually rather elementary, meaning.  This meaning is determined by the meaning of each one of their components, though not always in a straightforward way.  These structures are organized in even larger structures like sentences, etc. . . . Such organization provides linguistics with the means to develop the methods of intelligent text processing."&lt;/I&gt;&lt;/blockquote&gt;Like Wikipedia and its ongoing efforts at &lt;A href="http://en.wikipedia.org/wiki/Wikipedia:Disambiguation"&gt;disambiguation&lt;/A&gt;, WordNet cannot be all things to all people, nor should it try, notwithstanding Lukas Biewald's assertion that WordNet should make allowances for words meaning "all of the above."  What WordNet can be, is, and should remain, is a srong foundation for more sophisticated semantic analysis.</description><link>http://blog.xodp.org/2007/02/wordnet-redux.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117192125005213536</guid><pubDate>Tue, 20 Feb 2007 00:28:00 +0000</pubDate><atom:updated>2007-02-19T16:31:31.086-08:00</atom:updated><title>WordNet, Disambiguation, and the Semantic Web</title><description>While following up on &lt;A href="http://blog.xodp.org/2007/02/powerset-counts-coup-with-google.html"&gt;a previous XODP Blog post&lt;/A&gt; that covered Powerset's attempts to outdo Google by using Natural Language Processing (NLP), I discovered a book entitled &lt;A href="http://www.gelbukh.com/clbook/"&gt;Computational Linguistics&lt;/A&gt;, available for free in both HTML and PDF format.  As someone who has a background in both linguistics and cognitive science, I found the book a fascinating read, and about a third of the way through it, I discovered Princeton's &lt;a href="http://wordnet.princeton.edu/"&gt;WordNet&lt;/a&gt;.  While WordNet is anything but obscure, as NLP resources go, I thnk WordNet is one of the world's best kept secrets.&lt;br /&gt;&lt;br /&gt;A while back &lt;A href="http://blog.xodp.org/2006/08/keywords-semantics-and-disambiguation.html"&gt;I heaped praised on Wikipedia for its ability to disambiguate keyword-based queries&lt;/A&gt;.  Without taking anything away from Wikipedia's ongoing efforts at disambiguation, WordNet is already the most comprehensive open source and open content online resource for disambiguation of English nouns, verbs, and adjectives, providing extensive semantic analysis for just under 150,000 unique word strings.  Noticeably missing from WordNet's database are articles, prepositions, pronouns, conjunctions, and word particles.  Moreover, WordNet does not provide any information about any word's etymology or pronunciation.  However, Wordnet does provide operational definitions for wordstrings, information about their common usage, and comprehensive semantic grouping information, including synonyms, antonyms, hyponyms, hypernyms, meronyms, holonyms, and troponyms.  And if you want to know what any of those words mean, I suggest that you go on over to WordNet and enter them into the online interface.&lt;br /&gt;&lt;br /&gt;WordNet has more or less solved one of the most basic challenges that was facing developers of &lt;A href="http://blog.xodp.org/2006/08/what-is-semantic-web.html"&gt;the Semantic Web&lt;/A&gt;, as WordNet provides a relatively comprehensive operational lexical ontology for the English language where the taxonomy is both clear and definite, but still very, very flexible.  In lieu of a standard search engine algorithm, which is limited to determining what I call "keyword relevancy," a properly configured semantic web user agent would be able to determine the actual semantic relevancy of online resources.  Moreover, the end user would not have to resort to a series of keyword-based searches.  Rather, an end user would have an ongoing conversation with his or her user agent, providing more and more feedback on what said end user wanted to know.&lt;br /&gt;&lt;br /&gt;The Semantic Web envisioned by Tim Berners-Lee suffers from the perception that it's a high falutin enterprise and is commonly referred to by most web developers as the Pedantic Web.  This ivory tower disconnect could easily be remedied by creating user-friendly user agents.  In time, casual end users would be willing to use a semantic search agent when they encountered problems finding useful online resources with a standard search engine.  Meanwhile, people who want to use the Web for serious research would be able to carve out a semantic niche for themselves.</description><link>http://blog.xodp.org/2007/02/wordnet-disambiguation-and-semantic.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117184800796730014</guid><pubDate>Mon, 19 Feb 2007 03:24:00 +0000</pubDate><atom:updated>2007-02-18T19:27:19.430-08:00</atom:updated><title>Powerset Counts Coup with Google</title><description>A &lt;a href="http://battellemedia.com/archives/003350.php"&gt;post on John Battelle's Searchblog&lt;/a&gt; points to &lt;A href="http://venturebeat.com/2007/02/08/powersets-search-technology-scoop-may-scare-google/"&gt;an article on VentureBeat&lt;/A&gt;, the latter being journalist Matt Marshall's recently launched online publication that covers developments in the venture capital world.&lt;blockquote&gt;&lt;I&gt;"Powerset, a San Francisco search engine company, will announce Friday it has won exclusive rights to significant search engine technology it says may help propel it past Google.&lt;BR&gt;&lt;BR&gt;"The technology, developed at Palo Alto Research Center (PARC) in Silicon Valley, seeks to understand the meanings between words, akin to the way humans understand language — and is thus called 'natural language. . . .'&lt;BR&gt;&lt;BR&gt;"The deal is significant because practical use of linguistic technology has eluded Google. . . .&lt;BR&gt;&lt;BR&gt;"[ . . . ]&lt;BR&gt;&lt;BR&gt;" . . . Powerset could possibly steal a lead if it improves search results by a significant measure with natural language and simultaneously incorporates a near-equivalent to Google’s existing capabilities. Powerset has been hiring lots of Yahoo search experts and others, to help it do that.&lt;BR&gt;&lt;BR&gt;"[ . . . ]&lt;BR&gt;&lt;BR&gt;". . . Negotiations on the deal, just completed, were so secretive that Powerset’s executives hid a Xerox PARC scientist, Ron Kaplan, in a back room when VentureBeat stopped by for an interview last year. Kaplan, who has led the 'natural language' group for several years, joined Powerset as chief technology officer in July. This is a coup for Powerset, because Kaplan did not respond to some early probes from Google. In an interview, Kaplan said he didn't believe Google took natural language seriously enough. 'Deep analysis is not what they’re doing. Their orientation is toward shallow relevance, and they do it well.' Powerset, however, 'is much deeper, much more exciting. It really is the whole kit and caboodle.' While natural language has been a vexing problem for decades, Kaplan said he believes it is ready for prime-time."&lt;/I&gt;&lt;/blockquote&gt;The fact that a heavy-hitter like Kaplan chose Powerset over Google is quite remarkable, as Google has been responsible for a brain drain in the search engine space for quite some time.  Of course, the technology is being hyped, site unseen, and as pointed out by Matt at Venturebeat, it also remains to be seen whether people can be convinced to change their keyword-based search behavior.</description><link>http://blog.xodp.org/2007/02/powerset-counts-coup-with-google.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117172244915836752</guid><pubDate>Mon, 19 Feb 2007 00:01:00 +0000</pubDate><atom:updated>2007-02-18T16:03:50.916-08:00</atom:updated><title>Wikipedia's Feckless Attempts to Fight Link Spam</title><description>Although it's hardly breaking news, I recently discovered that &lt;a href="http://lists.wikimedia.org/pipermail/wikien-l/2007-January/061137.html"&gt;Wikipedia developer Brion Vibber has reinstated nofollow attributes for outbound links on Wikipedia articles&lt;/a&gt;.  Brion took this action at the request of Wikipedia founder and God-king Jimbo Wales, but Brion also mentioned that he had heard a rumor that a search engine optimization (SEO) contest threatened to overwhelm Wikipedia with spam.  According to Brion, &lt;A href="http://lists.wikimedia.org/pipermail/wikien-l/2007-January/061371.html"&gt;the change will be for the indefinite future&lt;/A&gt;:&lt;blockquote&gt;&lt;I&gt;"I'd prefer to see actual improvements (whitelisting, fading, flagging and approval system, etc) rather than just turn it off one day, though."&lt;/I&gt;&lt;/blockquote&gt;Hard on the heels of this policy change, most members of the SEO community started crying foul, even as a significant number of Wikipedians started chanting, "Ding, dong, the witch is dead."  Both of these responses are what I would have expected.  However, when the nofollow feature was first put to a vote on Wikipedia some time ago, &lt;A href="http://en.wikipedia.org/wiki/Wikipedia:Nofollow"&gt;a majority of Wikipedians voted to turn off the nofollow attribute&lt;/A&gt; on Wikimedia software, returning Wikipedia to the &lt;I&gt;status quo ante&lt;/I&gt;.&lt;br /&gt;&lt;br /&gt;Even with the nofollow atribute enabled on Wikimedia software, Wikipedia remains an attractive nuisance for link spammers, as Wikipedia is currently ranked 12th on Alexa for overall web traffic.  Assuming that the nofollow attribute does what it purports to do, all the current Wikipedia nofollow policy does is make sure that a search engine like Google does not count links on Wikipedia as votes in favor of increasing a particular URL's Google PageRank.  While this may have a phenomenal impact on search engine rankings and results over the next month or two, it will have little to no impact on Wikipedia link spam, at least not in the foreseeable future.&lt;br /&gt;&lt;br /&gt;Simply put, Wikipedia's nofollow policy makes about as much sense as screen doors on a submarine.  While the latter may keep the fish out, the floodgates will remain open, and the shameful joy of the Wikipedians who think that Jimbo has struck a blow for truth, justice, and the American way is sounding more and more like &lt;A href="http://lists.wikimedia.org/pipermail/wikien-l/2007-January/061426.html"&gt;hollow and self-aggrandizing rhetoric:&lt;/A&gt;&lt;blockquote&gt;&lt;I&gt;"We are not a links directory which is one of the problems with various ideas to selectively turn off no follow[sic]. If we do that we are basicaly[sic] admiting[sic] we are a links directory and we would gain very little for doing so."&lt;/I&gt;&lt;/blockquote&gt;Notwithstanding claims to the contrary, Wikipedia is a trusted source of URLs, not unlike a links directory.  Moreover, it's pretty clear that link spam is already a serious problem for Wikipedia, and that the problem is only going to get much worse for the foreseeable future, prompting me to revise what I asserted in &lt;A href="http://blog.xodp.org/2007/02/what-are-wikipedias-limitations.html"&gt;a previous blog post&lt;/A&gt; that the first practical limitation that Wikipedia will encounter will be in its ability to attract additional contributors.  Indeed, the problem that Wikipedia now has is that it is attracting too many contributors with a hidden agenda.</description><link>http://blog.xodp.org/2007/02/wikipedias-feckless-attempts-to-fight.html</link><author>Internet Esquire</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-16014280.post-117171195914983363</guid><pubDate>Sat, 17 Feb 2007 12:49:00 +0000</pubDate><atom:updated>2007-02-17T04:52:02.023-08:00</atom:updated><title>What Are Wikipedia's Limitations?</title><description>In a &lt;A href="http://blog.xodp.org/2007/01/promise-of-wikiseek.html"&gt;previous XODP Blog post&lt;/A&gt;, I stated my belief that whatever creative and progressive genius ODP/dMOZ once had has been inherited by Wikipedia.  Following up on this belief, I considered the fact that &lt;A href="http://en.wikipedia.org/wiki/Wikipedia:Size_comparisons"&gt;the number of articles on Wikipedia&lt;/A&gt; rivals the number of URLs indexed in the ODP database.  And while my cursory review of &lt;A href="http://en.wikipedia.org/wiki/Wikipedia:Statistics"&gt;Wikipedia's statistics&lt;/A&gt; failed to determine exactly how many outbound links Wikipedia has, I think it's safe to say that the number of outbound links in Wikipedia articles is already much larger than &lt;A href="http://en.wikipedia.org/wiki/Wikipedia:Size_comparisons#Internet"&gt;the number of URLs indexed in the ODP database&lt;/A&gt;.  This prompts me to ask:  What are Wikipedia's limitations?&lt;br /&gt;&lt;br /&gt;Given the highly decentralized nature of Wikipedia editing practices and the scalability of Wikipedia's open source platform, there are virtually no internal limitations to Wikipedia's potential for growth, whether it be growth in the sheer amount of content, growth in the number of articles, growth in the number of outbound links, or growth in the number of contributors.  Rather, Wikipedia has a demonstrated potential for &lt;A href="http://en.wikipedia.org/wiki/Wikipedia:Modelling_Wikipedia%27s_growth"&gt;exponential growth&lt;/A&gt;, and its practical limitations are found in the potential number of topics to be covered, the amount of outside content to be indexed, and the number of people who are willing to contribute their time and talents to Wikipedia.  A derivative limitation is quality control, as &lt;A href="http://blog.xodp.org/2007/01/wikipedias-bias-against-experts.html"&gt;Wikipedia's so-called "conflict of interest" policy&lt;/A&gt; scares away many subject matter experts.&lt;br /&gt;&lt;br /&gt;Of the many growth metrics one can measure, the only one that indicates Wikipedia's growth is slowing down is &lt;A href="http://meta.wikimedia.org/wiki/Wikipedia.org_is_more_popular_than..."&gt;its prominence on Alexa&lt;/A&gt;, where Wikipedia is currently ranked #12.  To a certain degree, this is comparing apples and oranges, as none of the sites that are impeding Wikipedia's growth in this way are reference sites.  Rather, Wikipedia is now going head to head with established search engines and community portals and is steadily gaining ground; as I have stated previously, there is every reason to believe that Wikipedia can and will make search engines as we know them obsolete because of &lt;A href="http://blog.xodp.org/2006/08/keywords-semantics-and-disambiguation.html"&gt;Wikipedia's focus on disambiguating keyword-based queries&lt;/A&gt;.&lt;br /&gt;&lt;br /&gt;Although it's impossible to predict when it will happen, the first practical limitation that Wikipedia will encounter will be in its ability to attract additional contributors.  A related problem that I alluded to above is quality control based on Wikipedia's inherent bias against subject matter experts.  Absent some incredible breakthroughs in artificial intelligence, these limitations will impede Wikipedia's growth long before any inherent limitations in the number of potential Wikipedia topics to be covered.  Meanwhile, based on Wikipedia's success at being the purveyor of general knowledge, I think the &lt;A href="http://wikiindex.org/index.php?title=Welcome"&gt;various expert-based wikis&lt;/A&gt; that are currently emerging will find it easier to attract contributors.</description><link>http://blog.xodp.org/2007/02/what-are-wikipedias-limitations.html</link><author>Internet Esquire</author></item></channel></rss>