Access Insights

Interdisciplinary Taxonomies

By |July 4th, 2011|Access Insights, Featured, Taxonomy|Comments Off on Interdisciplinary Taxonomies

People need to decide when they are building their taxonomy if they have one point of origin or if they are interdisciplinary. Some firms start with a one-point path; it is that path and that path alone that drives the firm. They have a single point of origin. They will need to clearly state term relationships, which is not always done.

Uses for a Thesaurus

By |June 27th, 2011|Access Insights, Featured, indexing, Taxonomy|Comments Off on Uses for a Thesaurus

A thesaurus can be used for several different purposes. It is mainly used for information retrieval, in one way or another. We use it in information retrieval both on the search end and in the tagging of the records.

From Simple to Complex

By |June 20th, 2011|Access Insights, Featured, ontology, Standards, Taxonomy, Term lists|Comments Off on From Simple to Complex

People talk about different kinds of vocabularies. The differences usually have to do with the structure, or lack thereof. Sometimes, people refer to “flat lists”. These are one-level lists with no hierarchical structure. They can be uncontrolled or controlled lists. An uncontrolled list is a simple, flat structure. The uncontrolled list is your “Saturday list”.

Pre or Post-Coordinate Indexing?

Most people think about what they want to search for and are willing to combine their concepts at the time of search. If you think of the way people search in Google, they put in the combination of terms they are thinking of; they are doing the coordination of terms. It is up to the search software to do the intersection of the terms for them and figure out the post-coordination.

Search as Big Brother, Molding What You See and Think

A recent TED presentation is by Eli Pariser. He is the author of “The Filter Bubble: What the Internet Is Hiding From You.” A new and very interesting book. His talk is a synopsis of how the Google personalization algorithms effect search results. Google results are influenced by your own search history and other online activity. Any system such as Amazon, Yahoo, Bing ebay shopping systems depend heavily on personalization to serve you results. Traditional databases do not use profiles (yet) but they are often based on Verity, Vivisimo, Autonomy, Fast and other mathematically based search software so they could and they do serve up different results whenever the vectors are reset - that is every time additional data is added to the system with updates or metadata enrichment.

Adventures of a TaxoTourist

By |May 23rd, 2011|Access Insights, Featured, Taxonomy, Term lists|Comments Off on Adventures of a TaxoTourist

The trip was awesome—a dream exotic vacation to Bali. It was not about eat, pray, love, but a rather unbalanced midpoint to meet my Oz-dwelling daughter. I enjoyed dashes of ecotourism and agritourism, but even in full vacation mode I couldn’t fully suppress my perspective as a taxonomist.

The Semantic Web Goes Mainstream

By |May 16th, 2011|Access Insights, Featured, semantic|Comments Off on The Semantic Web Goes Mainstream

The recent MarkLogic User Conference was a watershed event for the publishers in attendance, many of whom are just beginning to strategize about the application of semantic technology to their content. After years of hearing “the Semantic Web is coming,” the message this time was that it’s no longer about “what” or “why,” but “how” publishers will leverage this technology. It has been 10 years since Tim Berners-Lee, Jim Hendler, and Ora Lassila announced the creation of the Semantic Web, so many of us were very excited to hear Jim Hendler’s update on current developments. Some key themes of his presentation were already covered in this article from August, 2010 in New Scientist: Google, Twitter, and Facebook Build the Semantic Web. With his trademark slogan, “A little semantics goes a long way,” Hendler added some further context, and described how these companies and others have tapped into social and commercial drivers to promote relatively simple approaches to solving the problem of getting content tagged, and thus increasing the ability for computers to understand the meaning of text across vast amounts of Web content.

Not True!

By |May 9th, 2011|Access Insights, Featured, Folksonomy, search, Taxonomy|Comments Off on Not True!

The Autonomy folks must be getting worried about the progress of taxonomy applications and the precision and recall that such systems provide. Autonomy and Google live on relevance rankings as the return to the user. Relevance to me is a confidence game. It is the best guess of the system as to whether the results returned will actually match the user's request. If you have a big enough data set returned, certainly something in there will be useful. But the sheer amount of items the user has to review (or amount of noise they have to look at) is very annoying. So they rank the returns by relevance based on a number of statistical factors so the most likely items based on co-occurrence with terms matches and near matches will appear at the top of the list - that is, they will be relevance ranked.

Where Are They Now?

By |May 2nd, 2011|Access Insights, Business strategy, Featured, Standards, Technology|Comments Off on Where Are They Now?

Tech companies come and go. There are always tech savvy entrepreneurs with big ideas looking to fill a need in the market and investors looking to get the huge returns that only come from investing in tech startups.

Name Disambiguation Musings

The Wall Street Journal on April 19, 2011 talked about the need for customer name authority control in banks. Okay, so maybe that is not what they said. What they did outline was the problem Arabic names and the many ways to state them gives to banks and other organizations which try to track the information or put a hold on funds for organizations like the example, Moammar Gadhafi. Also know as many other names. His first name could be transliterated as Muammar, Mummar, Mohamed Mahmut, Mehmud and more than 20 other variants. The same goes last name could be Gaddafi, Ghathafi, Elkaddafi, El-Kaddafi, Al-Gaddafi, Gadhafi, Qaddafi, Al-Qadhafi, El-Qaddfi, Qadhafi, Abu Miryar Al-Qahafi, Ghadaffi, and others. Any combination of these names is valid. There are further complications of the Abu or Al or El and other designations of honor make things even more interesting.