Among Good Company

March 25, 2015  
Posted in Access Insights, News, Technology

TEMIS joins the prestigious list of 100 Companies That Matter in Knowledge Management. KMWorld made their annual announcement recently as they recognized the top industry influencers in content, document, and knowledge management. This information came to us from Bio IT World in their article, “TEMIS Named to KMWorld’s 2015 List of “100 Companies that Matter In Knowledge Management”.”

This was the 15th edition of the list that is compiled by KM practitioners, theorists, analysts, vendors, and their customers and colleagues.

Access Innovations was also included in this prestigious list. For a full list of the Top 100 Companies That Matter in Knowledge Management, click here.

Melody K. Smith

Sponsored by Access Innovations, the world leader in thesaurus, ontology, and taxonomy creation and metadata application.

The Promise and Pitfalls of Classifying Food

March 24, 2015  
Posted in Access Insights, Featured, Taxonomy

As a lover of all things tasty, my mind often turns to the kinds of food and drink that I love. As someone who works with taxonomies and thesauri, I tend to try to classify them. Often, though, I don’t get very far with it because I start to get hungry. However, I just ate, so I don’t think it’ll be so much of an issue this time.

Classifying food is at once extraordinarily basic and mind-bogglingly complex, depending on how deep you want to go with it. At its simplest, you have the USDA food pyramid (or plate, depending on when you’re talking about). As a simple guide to make sure your kids are eating the right proportions, the USDA guide can be helpful, if problematic. As a way to look at meaningful relationships between food, though, it’s simply too narrow to be of any real use.

To see a much more complex food classification system, one can simply enter a grocery store.  In there, thousands of food items sit on shelves, organized in a very specific, scientifically driven way. This organization, though, is based on sales maximization, not organizational consistency. That’s how rice ends up in multiple places, with the cheap basic stuff with the other staples and nicely packaged, and pricier styles in International Foods, or somewhere similar. So while there are very good reasons for how the items in a grocery store are arranged, this isn’t the kind of organization that I mean.

I’m thinking of organization based on what food and drink is and how it is viewed by cooks and eaters, not on how to boost sales of the latest in frozen pot pies. However, this can get extremely complicated.

With thousands of various types of food and drink in this world, questions immediately arise that confuse the issue. We all know what bread is, and we all know what cake is. They use nearly all the same ingredients and the result is similar, if very distinct. If we’re building a taxonomy, are they distinct concepts? Is cake a type of bread, or is the fact that one is eaten mostly for dessert, while the other generally isn’t, a big enough difference to keep them separated? How about a box of macaroni and cheese? Obviously, a greater part of what’s in that box is pasta, but in the grocery store, it’s generally nowhere near an actual package of macaroni. Does that little bag of weird cheese powder in the box make it an entirely different product? It seems like a subset to me.

There are problems like this everywhere, which makes attempts at organization seem futile. Where do we even start? In taxonomic terms, the basically useless (for our purposes) food pyramid gives us a few broadest terms to work with. It’s woefully incomplete, but it’s a place to start. Australia did something a little like this with their Australian Health Survey Classification System, which was designed “to group similar foods and report trends in consumption by food category.” While it’s useful and quite interesting from a public health perspective, the near-700 line spreadsheet makes it indecipherable for use by your average eater.

Unless all we want is an organized but flat list of foods and beverages, it seems we must decide on the purpose of the classification, because nothing is going to be one-size-fits-all. There isn’t a comprehensive food taxonomy out there, at least that I know of, but there some really intriguing things that people have done with very specific kinds of classification.

In his book, On Food and Cooking: The Science and Lore of the Kitchen, food scientist and writer Harold McGee features a two-page table (which I cannot post here) that features, on the y-axis, the names of commonly used herbs, and on the x-axis, the chemical compounds that give each its distinct flavor. One look at the table reveals how much is shared between different herbs. Say you’re cooking something and need basil, but are surprisingly all out of it. McGee’s table can show you what other herbs contain the chemical or chemicals that you need to match that flavor. You might get some extra stuff in the dish that you didn’t need, but you will have the flavors that you want.

Then there’s chef Marc Powell, who built a food app that reads menus, turns them into XML documents, and tags them with taxonomy-based metadata for taste, texture, and other food characteristics. This metadata can then be used to do make recommendations for balancing the flavors of a dish, providing a list of ingredients to concoct possible dishes, or any number of possibilities.

I would absolutely love to use that tool; it’s exactly the sort of thing that I want, though for it to work the way I have in my head, I don’t think a simple taxonomy, no matter how large, would be enough, precisely because of the complications that I describe above. On the other hand, an ontology that relates ingredients to associated recipes could be extremely useful. If I could just open my refrigerator or pantry, search in the ontology for the ingredients that I have in there, and have it return possible dishes that use only what I have would change the game for me. With the Internet of Things coming closer and closer to reality for the masses, this doesn’t seem all that far-fetched.

All of this talk about food has given me quite an appetite, but at least I could complete the thought this time.

Daryl Loomis
Access Innovations

Emojis: Communication without Words

March 16, 2015  
Posted in Access Insights, Featured, Technology

Each year since 2000, the Global Language Monitor has selected the Top Words of the Year, which they derive through statistical analysis of word usage. Some may have thought the organization had gone insane this year when they announced the selections for 2014, because at the very top of the list was not a word at all, but the heart emoji. It’s true; a tiny cartoon heart got used so often last year that it supplanted all actual words.

People might shake their heads at that fact and lament what happened to language with kids these days, but it’s how people communicate online, which is basically how people communicate at all anymore, so it’s well worth looking into how and why this has happened.

No matter how one might personally feel about it, there’s no denying the rampant popularity of emojis. They are more commonly used on Twitter than the digit 5, and the single most popular emoji is more commonly used than the tilde. Those facts are crazy to me, and text analytics company Luminoso has compiled even more. Emojis have taken over at lightning speed and there’s no stopping them, so we might as well start trying to find the meaning in them.

As I discussed previously, I have found myself fascinated the last few weeks after I discovered emojitracker.com. By making use of Twitter’s streaming API, it tracks emoji use across the globe in real time. Though it only uses Twitter and not all the other places where the characters are used, the numbers are still mind-blowing.  The most popular character, “face with tears of joy,” has been used more than 632 million times since the site opened and it, along with the others at the top of the list, increases at an extraordinary pace.

It’s not just a flood of numbers, either. You can click on each icon to see a feed of the tweets that the emojis were used on, as well as see those results in JSON markup language; this is the stuff that I find highly interesting. The feeds for the top ranked emojis move far too quickly to understand anything by the naked eye, but there are things to look at in some of the less popular ones.

Take, for instance, the emoji labelled “Pedestrian,” which is simply a man walking. Oddly, of the 16 million times this symbol has been tweeted, nearly every one is in Arabic. Why are they all walking? To see this stuff with the eye, one has to wade through so much material that it would be simply too daunting to actually find larger meaning in any of this.

Computers could easily parse it all out, though. The trouble is that, while it’s interesting to see the data stream, nothing is really being done with it. Despite the fact that it’s already an example of linked data, there is next to no analysis. That site lives in a vacuum, but emoji usage doesn’t. It grows and evolves more rapidly than text language does, and people from different cultures and groups assign their own meanings to single characters and groups.

Yet, in spite of that evolution that, for whatever reason, makes the pedestrian symbol appealing to Arabic speakers, emojis also have somewhat universally defined meanings that make actual communication possible. The Wall Street Journal allows you to translate their headlines into emojis and, though you sometimes have to stretch a bit, it’s pretty easy to see where they’re coming from. Likewise, in a far more absurd example, Herman Melville’s classic Moby Dick has been turned into emoji. Of course, all the deep contextual and literary meaning will be lost in translation, so to speak, but if the words can be communicated in any kind of comprehensible fashion, that’s pretty impressive, if rather pointless.

The problem with all of this from a semantics perspective is that if the meaning does continue to evolve, how could one possibly analyze the data in a meaningful way? Were one to get a comprehensible result today, would they get that same result later? It’s important in semantic analysis to get provable, repeatable results. You can’t see patterns in data when the rules keep changing.

Like it or not, this is the way people communicate today and, whether or not I think emojis are a lasting phenomenon or will be an enduring part of language (I don’t), they don’t seem to be a product of laziness. Instead, they are about speed and clarity of communication. If we can express a complex emotion like love using one symbol rather than many, people are going to gravitate toward it, just like they gravitated toward texting and Twitter rather than tedious old email.

Words and their meanings are always in flux, just very slowly. The difference between our current English and Geoffrey Chaucer’s is massive, but it happened over six centuries. Still, the language is comprehensible without translation. The meaning of emojis may change at a faster pace, but their meanings are still being communicated to people around the world, regardless of language or cultural barriers. To me, that alone is reason enough to want a much deeper understanding of how they’re being used.

Daryl Loomis
Access Innovations

Access Innovations Named in KMWorld’s Annual “100 Companies That Matter in Knowledge Management”

March 9, 2015  
Posted in Access Insights, Featured

Access Innovations, Inc., a leader in digital data organization, is pleased to announce its inclusion on KMWorld’s annual list of the “Top 100 Companies That Matter in Knowledge Management.”

Access Innovations is featured for its fourth year after first debuting on the list in 2009. Other notable companies given a spot on 2015’s top 100 list include Oracle, Google, IBM, and Microsoft.

“The criteria for inclusion on the list vary, but each of those listed have things in common. Each has either helped to create a market, redefine it, enhance or extend it,” said Hugh McKellar, KMWorld Editor-in-Chief. “They all share a fundamental motivation to innovatively meet and anticipate the widely diverse needs of customers with robust solutions to meet evolving customer requirements challenges.”

Marjorie M.K. Hlava, president of Access Innovations, is honored that the company is included on the list. “Access Innovations enjoys  driving technology to face new challenges in knowledge management,” she says. “It’s stimulating and rewarding to be leaders in knowledge management, and it’s delightful to be recognized as a leader in the field. Making content findable for our customers and their users is and always will be our top priority.”

The annual Top 100 Companies That Matter list is compiled by editorial colleagues, analysts, theorists, and practitioners and, unlike many other trade lists, inclusion is not purchased and is at the sole discretion of KMWorld’s editors.

For a full list of the Top 100 Companies That Matter in Knowledge Management, pick up the March issue of KMWorld, available at newsstands now, or visit the following link to view the article online: http://www.kmworld.com/Articles/Editorial/Features/KMWorld-100-COMPANIES-That-Matter-in-Knowledge-Management-102189.aspx

About Access Innovations, Inc. – www.accessinn.com, www.dataharmony.com, www.taxodiary.com

Founded in 1978, Access Innovations has extensive experience with Internet technology applications, master data management, database creation, thesaurus/taxonomy creation, and semantic integration. Access Innovations’ Data Harmony software includes machine aided indexing, thesaurus management, an XML Intranet System (XIS), and metadata extraction for content creation developed to meet production environment needs. Data Harmony is used by publishers, governments, and corporate clients throughout the world.

About KMWorld

The leading information provider serving the knowledge, document, and content management systems market, KMWorld informs more than 45,000 subscribers about the components and processes—and subsequent success stories—that together offer solutions for improving business performance. KMWorld is a publishing unit of Information Today, Inc.

Data Analysis in a Standards-Challenged World

We deal pretty heavily around here in words, what they mean, and how they’re used. It should go without saying, but it’s a fundamental part of what we do and is what makes us so concerned with standards, in both taxonomies and the written word. The two go hand in hand; it’s a whole lot easier to have one when the other is compliant.

Academic publishing, which we deal with the most, and most publishing in general, is pretty good about standards, and so we’re able to easily go in and build a taxonomy or mine the content for data analysis. There has been plenty of talk about how useful that enriched content can be in regard to linked open data, direct consumer advertising, and all that. It’s all well and good, but in places where there aren’t standards, it’s a whole lot more difficult to deal with, at least in a semantic sense.

Only a short time ago, nearly all disseminated content would go through some kind of editing process to make sure that simple things like spelling, grammar, and syntax were correct, but also to be sure that it complied with appropriate standards. Only once that process was complete would the public see anything, at least for the most part. This, of course, made for perfectly readable and understandable content, assuming you were familiar with the language and the jargon.

Then the Internet happened. Now, poorly constructed text is the norm and standards have gone out the window. Much of the Internet is about speed of delivery, and content is given a cursory edit, at best. Compliance with standards and delivery speed rarely make good bedfellows.

I’m not here to argue the rightness of accuracy over speed; the change has happened and there’s no going back. Blogging and especially social media have blown up the old ways. This is just how people communicate today and, to me, this is content that begging for analysis.

But how does one even begin? The one who worries about grammar or syntax in a tweet is a rare beast indeed, and that doesn’t even take into account how things are spelled. Multiple Z’s instead of a single S, fifteen O’s when writing “love,” numbers in place of letters, all sorts of ridiculous things.

Add into that the multitude of languages that people use online, along with widespread disregard for traditional spelling and grammar, and the number explodes. However, because more people from more cultures are communicating with one another, it seems even more important to find a way to be able to control and structure all this data that we have for the same reason that we structure vocabularies for scholarly publishing: quick and easy search.

In theory, that’s what tags on blogs and hashtags on social media are for, but when anybody can come up with their own tags, it’s plain chaos. This anarchy is something that has no place in an information realm that requires at least some degree of standardization. Tags might never be completely standardized, but a system of organizing them into broad concepts may be a solution. There will always be things like #janetiswinning, #imeatingdinner or whatever, so noise is going to be inevitable, but some kind of broader classification could help people find what they’re looking for, given how much new content is produced every hour of every day from every corner of the world.

That noise will always exist, but we can’t dismiss it all as trash. Communicating through social media has become too important a part of our lives to pretend that there’s no value in at least some of the millions of tweets, Facebook posts, Instagrams, Vines, and all the rest. If there’s no value from a scholarly standpoint, there still is from anthropological and political ones, and the power that marketing gains using this kind of data analysis is abundantly clear.

This is conceptually pretty simple when we’re talking about data in the form of text, whether that’s a post or a hashtag. They’re all words, after all, even if they’re spelled tragically wrong. What about things that aren’t words, but still convey concepts? Instagram and Vine are currently two of the fastest growing social media sites and, though they use hashtags, they deliver content visually.

And then there’s the whole new issue of emojis. That might seem like a small thing at first, but they aren’t necessarily used at random, and some are used very specifically. An additional wrinkle with these is that they communicate meaning across languages. It seems to have huge potential for analysis, but is effective analysis even possible given the amount of noise?

I think that the answer is almost certainly yes. This kind of data is too valuable not to mine, especially when the technologies for doing so are already being developed for other purposes. For text, there are developments in sentiment analysis that have already been implemented to analyze social media for political campaigns, and its uses are only going to evolve. Less has been done on this level for imagery, but if a computer algorithm can be built that can accurately identify Jackson Pollock paintings and if a self-driving car can determine spatial proximity and object identification in real time, certainly the potential exists for use in social media. Almost nothing has been done with analyzing emoji use, though there is Emojitracker, which absolutely fascinates me (and I will write about at more length in a later post).

We used to communicate almost exclusively by the written word, but now the technology exists to communicate meaning in a large number of ways. Shouldn’t we analyze and study that meaning? I don’t have answers to the questions, but the more we explore these new realms, it seems like time to start thinking about semantics in a slightly broader way. Standards are important and I’m all for them. But I’m also all for people communicating with each other. The least we can do, as people who work in semantics, is to try to find ways to see meaning in the content of social media, even if it seems like a great bog of nonsense some of the time.

Daryl Loomis
Access Innovations

Making Connections

February 23, 2015  
Posted in Access Insights, Featured

The 11th annual Data Harmony Users Group (DHUG) meeting just wrapped up, and as we reflect on the week-long event, making connections is the unofficial theme that emerged. This makes perfect sense, because the benefit that our attendees mention most often is the networking opportunity – the connections we make with colleagues – during the DHUG meetings.

This year we asked our attendees to fill out a short survey about the meeting. I have included some of their responses in the following list of connections that the DHUG meetings make or explain.

  • We meet and connect with other people who are doing what we do. One attendee said “Networking is an important aspect of the meeting” and “Excellent opportunity to establish and deepen networks for later follow up.”
  • We connect with people who can help overcome challenges that we are facing – because they are facing similar challenges. The most valuable thing on the agenda for one attendee? “Case studies. These are the main reason I come. Love seeing what other folks are doing.”
  • We connect ideas and find answers.
  • We get new ideas for additional things we can do with our data or systems.
  • We connect our users’ viewpoints with the functionality of the software and the services we offer – we explain not only WHAT the software does, but also HOW it works and WHY we designed it that way.
  • We connect authors with editors and peer reviewers.
  • We connect content to other related content.
  • We connect related concepts – not just the words that appear, but the meanings of those words.

Here are a few other comments taken from our survey responses:

“Really enjoyed it – right balance of updates and networking opportunities”

“Really well organized event with delightful people – thank you!”

“I think the range of topics and presentations was good. It’s good to have exposure to the variety of subjects.”

“Hearing from a researcher was great, implementing stories and creative uses also great”

“I honestly like [the meeting] the way it is”

If you missed the DHUG meeting, consider joining us in 2016! We have tentatively scheduled the 12th Annual DHUG meeting for February 22-26, 2016. Marjorie M.K. Hlava’s annual features update will kick off the core part of the meeting on February 23, and will be followed by case studies from our users on February 23 and 24. February 22, 25, and 26 are optional days reserved for hands-on software training and one-on-one meetings.

Watch TaxoDiary for an official announcement and further information.

Heather Kotula, Director of Communications
Access Innovations

Hands-on Learning

February 19, 2015  
Posted in Access Insights, News

The Data Harmony Users Group Meeting continued today with a presentation by Paul G. Kotula of the Materials Characterization department at Sandia National Laboratories. In the presentation, “Six Months of Work in the Lab will Save You Half a Day in the Library or 30 Minutes Online”, he shared his experience as both a consumer and a producer of peer-reviewed, published scientific literature.

Paul is an award-winning author, researcher, and peer reviewer who knows his stuff and knows how our clients use the content that our software enriches. During his presentation, he got specific about how people in his field use information and how researchers use collections.

Today wrapped up the case studies. Over the next two days there will be specific hands-on training, networking, and learning opportunities for the clients. Everyone seems to be eager to get their hands “dirty” and the Access Innovations staff are here and available to answer any questions.

Don’t forget to like our Facebook page to keep up with the latest news and information.

Melody K. Smith

Sponsored by Access Innovations, the world leader in taxonomies, metadata, and semantic enrichment to make your content findable.

The TaxoBook Reviewed

February 18, 2015  
Posted in Access Insights, News, Taxonomy

The Special Libraries Association (SLA) Knowledge Management Division recently reviewed the book, The Taxobook, written by our own Marjorie M.K. Hlava. Their observations and acknowledgments on the content, quality, and focus were in line with what the author intended. “I hope these books will contribute to a better understanding of the different ways taxonomies can be implemented and why information management professionals should embrace them,” said Hlava in the book’s release.

The reviewers were also quick to point out other attributes, i.e., “While each of the three parts comprising this work ends with a glossary, the distinguishing feature of Part 1 has to be the gorgeous, not to be missed illustrations.”

The books are available through Morgan & Claypool Publishers in either online or print format.

Melody K. Smith

Sponsored by Access Innovations, the world leader in taxonomies, metadata, and semantic enrichment to make your content findable.

Win Hansen Named Production Manager at Access Innovations, Inc.

February 16, 2015  
Posted in Access Insights, Featured

Access Innovations, Inc. is pleased to announce another big change to its corporate structure, a move that will streamline workflow and improve the efficiency of Access Innovations’ client projects.

Win Hansen has now been moved into his new role as production manager. Since starting at Access Innovations in 2009, Win has performed myriad tasks for the company and has learned every aspect of the business. This makes him uniquely suited to the wide range of tasks a production manager is required to perform. His versatility, taxonomy building expertise, and people management skills make him the perfect person for the role. Margie Hlava, President of Access Innovations, stated, “Win has been one of our most flexible and versatile employees for some time. He’s willing to get his hands dirty with any project, no matter how strange, with all of his enthusiasm and effort. He is a valuable asset to the Access Innovations family and we are all thrilled with his promotion to Production Manager.”

Win remarks, “I am very excited about this great opportunity. While I know there will be a lot to learn, I am certain that I am more than ready for what’s in store for me and I’m prepared, through this promotion, to help take Access Innovations into the future.”

Win started as a taxonomist at Access Innovations and, later, served as office manager and the company’s graphic designer. He has been involved in projects of all kinds, from taxonomy development to animation and more. Win has led projects for Triumph Learning, the American Society of Civil Engineers (ASCE), JSTOR, the American Institute for Physics (AIP), Harvard Business Publishing, and many more.

Win attended the University of New Mexico, where he earned degrees in history, religion, and art, and he vigorously continues in the classroom to this day. His interests include ceramics, photography, travel, chicken breeding, and beekeeping.

 

About Access Innovations, Inc. – www.accessinn.com, www.dataharmony.com, www.taxodiary.com

Founded in 1978, Access Innovations has extensive experience with Internet technology applications, master data management, database creation, thesaurus/taxonomy creation, and semantic integration. Access Innovations’ Data Harmony software includes machine aided indexing, thesaurus management, an XML Intranet System (XIS), and metadata extraction for content creation developed to meet production environment needs. Data Harmony is used by publishers, governments, and corporate clients throughout the world.

On the Cusp of DHUG

February 16, 2015  
Posted in Access Insights, News, Taxonomy

This week will be the 11th Annual Data Harmony Users Group (DHUG) meeting here in Albuquerque. It’s by far the biggest week of the year for Access Innovations as our clients come from all over the nation to meet and learn from the people who built the software and use it on a daily basis.

This is a “all hands on deck” meeting. Access Innovations staff will be present to answer questions, provide demonstrations, and give customer support for whatever your needs may be, all week long.

“Each year, this meeting provides our members an opportunity to share ideas and address issues and methodologies with colleagues,” said Access Innovations President Marjorie M.K. Hlava. “We enjoy talking with our clients and finding out what items are on their wish lists for future software developments, and the new releases reflect those requests.”

Attendees will learn how, why, and when taxonomies are used; how to start and maintain a taxonomy; and what resources are available for taxonomy development.

There will be daily reporting here on TaxoDiary, highlighting various presentations and speakers for those who can’t take notes fast enough or weren’t able to attend. One topic that is always a thread throughout presentations is subject matter experts (SMEs). These knowledgeable people are often used in the preparation of thesauri and taxonomies by providing key information and perspective.

If you have never attended DHUG before, take a moment to read Daryl Loomis’ article about what he is expecting as a first-timer.

See you there!

Melody K. Smith

Sponsored by Data Harmony, a unit of Access Innovations, the world leader in indexing and making content findable.

Next Page »