gutheim's blog

202ish Knowledge Need @ Google

This is interesting for all who fell in love with 202 topics: Google is searching for an intern who can do exactly what we have been learning and practicing for the last 4 months...

http://www.google.com/jobs/students/us/internships/eng/librarian-internship-cambridge/index.html

An outlook for our Wine tasting: Familiarize with Flavours - Get to know the Facets and Categories

In prospect of our wine tasting next week:

I'd like you to point you to a very 202ish case of applications of facets: "The Wine Wheel" (www.tinyurl.com/wine202a)

Also, it is interesting to combine the classic categories of wines (red wines, white wines, and Merlot etc.) with the facet flavours. (www.tinyurl.com/wine202b)

How Caesar encrypted his messages...

Having talked about how the Enigma encryption algorithm and the analysis of language, it is interesting to note that Caesar already used a character shift encryption method to secure his messages from other unauthorized people. As a response, his enemies started to analyze language and figured out the frequency of characters in the language (as Bob has mentioned several times). In that sense, the analysis of language already goes back to Caesar... http://en.wikipedia.org/wiki/Caesar_cipher

The German says: "Rindfleischetikettierungsüberwachungsaufgabenübertragungsgesetz"

A nice list of the longest German words. However, it is important to note that German is not as awkward as it seems to be. This is clearly an artifact of German bureaucracy.

http://german.about.com/library/blwort_long.htm

Btw: Rindfleischetikettierungsüberwachungsaufgabenübertragungsgesetz means "beef labeling regulation & delegation of supervision law"

Ask "Number of people in turkey" and get a smart answer vs "How to Figure Out What Size Turkey to Buy" results

When you type in phrases that are more natural to you but usually less processable by computers, search engines most of the times provide bad results. Google for instance gives you as the first result a website with the title "How to Figure Out What Size Turkey to Buy". To be fair, Google started to provide information such as population at the top of the search results when typing in more feasible questions. However, there is another search engine which specialized on quanitative data which is provided when querying actual questions. E.g.

Have a lot of tasks computers can't do? - Get an "Online Virtual Assistant"

There are still tasks out there which you can't easily do with a computer. They might be complex, ambiguous and/or adhoc. Why don't you get a online virtual assistant for that? S/he helps you with data analysis, research, outbound calling, blog management etc.

Check it out: www.asksunday.com/

Standards as Network Goods - The notion of the tipping point

As described in the lecture, standards are important vehicles for companies to win in a competitive market and to "take it all". Standards are network goods: The perceived utility of a user increases with the number of users to also use the same standard on their devices, software etc. Take Windows, a DVD or the Internet Protocol. This is because standard compliance allows interoperability, higher performance and increased reliability as well it enables new features and services.

Implications of Controlled Vocabularies - Opportunities and Limits

I'd like to take natural languages as an example for an interesting question I'm thinking about and use this to analyze implications for Controlled Vocabularies.

This post partially refers to a previous blog post (courses.ischool.berkeley.edu/i202/f10/blog/cultural-categorization) where the topic of cultural categorization using Russian imagery expressions as an example has been analyzed.

From women dominated text-processing departments to claud-based global marketing collaboration platforms

The metaphor of a women dominated text-processing department points out that there has been a centralized place several decades ago to process information that are meant for internal or external stakeholder - as described by Bob. This centralization - may it be intestinally or due to technological restrictions - enabled companies to enforce company-wide Corporate Communication policies and even Corporate Identities easier as nowadays.

Wikipedia Thesaurus Ontology Service

A service that extracts associations and their relationships to entires in Wikipedia, i.e. crawling a wikipedia side an generating tags based on the content. The Platfrom works like a search engine wher users type in a word and an ontology that distinguishes between people, places, labels etc. is displayed.

dev.sigwp.org/WikipediaThesaurusV3/

Syndicate content