Archive

Archive for the ‘Bookshelf’ Category

Metadata and the World of Tomorrow: David Siegel’s Pull

March 6th, 2010 No comments

Book Review:  PullCover2
Pull: The Power of the Semantic Web to Transform Your Business

David Siegel
Portfolio Hardcover, 2010
ISBN-10: 1591842778, ISBM-13: 978-1591842774


As one of those people still bitter that personal jet-packs have yet to materialize, I am always a bit skeptical of breathless depictions of the future.  David Siegel’s new book, Pull: The Power of the Semantic Web to Transform Your Business definitely falls into this category, but in this case his predictions are likely to come true.  In fact, as he repeatedly asserts, many of them already have.  Ironically, much of my cynicism about rosy-eyed predictions of the World of Tomorrow is rooted in the promise of the Semantic Web as first articulated by Tim Berners-Lee, James Hendler and Ora Lassila in a 2001 article in Scientific American.  In that article Lee and his W3C compatriots painted the future of the web as “an environment where software agents roaming from page to page can readily carry out sophisticated tasks for users.”  This vision has never really materialized, but to a large degree  the semantic web has.  It is just emerging in a different form and with a different emphasis.  Rather than focusing on autonomous software-agents just shy of becoming self-aware, the semantic web is all about linked-data.

 

Linked Data is about using the Web to connect related data that wasn’t previously linked, or using the Web to lower the barriers to linking data currently linked using other methods. More specifically, Wikipedia defines Linked Data as "a term used to describe a recommended best practice for exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web using URIs and RDF."

–linkeddata.org

 

This is the manifestation of the Semantic Web Siegel is so excited about.   He takes a broad definition of the Semantic Web focusing mainly on the importance of shared metadata (ontologies aren’t discussed in any depth until chapter 12) and illustrates with business applications, both current and potential. Each chapter enthuses about the possibilities inherent in ubiquitous, structured and meaningfully tagged information.  His prose is often a bit too evangelical for my taste. You often come across sentences like, “As you learn about new game changing technologies, keep looking for the management mind shift you’ll need to go bravely into the world of pull.” And his scenarios often have a gee-whiz flavor similar to the agent-oriented predictions of Berners-Lee’s decade old vision.

 

In the semantic future, information lives online, waiting to be pulled through your device-mesh.  If you’re watching a movie at home and have to go to the airport, you’ll log into the display in the waiting lounge or on the airplane, and the movie will automatically continue where you left off. Your music or favorite news station will start playing as soon as you get into your rental car, you’ve logged into your personal data locker and now the rental car (including mirror and seat adjustments) is tuned to your personal ontology.  In fact, all your preferences will live online, and your preferences will replace many of the products you own today.

 

The notion of a “personal data locker” is a good example of how Siegel’s exuberance is actually pretty well grounded.  Though he doesn’t discuss it explicitly, Personal Health Records such as Microsoft’s HealthVault and Google Health are already in place with much of the functionality Siegel predicts.  (see my post from last year on Personal Health Records) In some cases, Siegel gets things wrong, as with his discussion of the Health Insurance Privacy and Accountability Act (HIPAA) but overall he does an excellent job of presenting the current state and direction of these technologies.  Just as important, he connects them to concrete business problems without jargon that may intimidate (or bore) non-technical readers. 

Pull: The Power of the Semantic Web to Transform Your Business is what I usually classify as an airplane read; a nice business-related title that can be read easily on a flight from Portland to New York.  It is written at the level of a Discovery Channel documentary and is quite engaging.  Imagine someone at a cocktail party, (after maybe one glass of wine more than is strictly prudent)  explaining to an intelligent but not technologically savvy guest what the semantic web is and why it matters.  This book is not going to give you the depth of information available in other titles such as Allemang and Hendler’s excellent Semantic Web for the Working Ontologist, but it is not intended to.  The vast majority of business readers do not need, nor want, to know the nitty-gritty details of OWL, RDFS and SPARQL.  They do need to know how semantic technologies as a whole are shaping the web and business in an information intensive world.  In that context, Siegel’s book succeeds admirably.

Bringing the Goundswell Behind the Firewall

January 27th, 2010 No comments

Enterprise 2.0

Book Review:
Enterprise 2.0: New Collaborative Tools for Your Organization’s Toughest Challenges

Andrew McAfee
Harvard Business School Press, 2009
ISBN-10: 1422125874, ISBN-13: 978-1422125878


In 2003 publisher Tim O’Reilly began using the term “Web 2.0” to describe the emergence of the “participatory web.”  Blogs, Wikis, reviews and recommendations were turning passive consumers into active contributors.  O’Reilly sponsored a conference around the concept and a buzzword was born.  For the past couple of years, MIT’s Andrew McAfee has been championing the use of these tools behind the firewall as a means to foster innovation and efficiency within organizations.  McAfee has summarized his research into the participatory enterprise in his new book Enterprise 2.0: New Collaborative Tools for Your Organization’s Toughest Challenges.

Enterprise 2.0 isn’t yet a full-fledge buzzword, but it’s getting there.  Enterprise social computing platforms are all the rage (there’s even an Enterprise 2.0 Conference) and vendors like SocialText, Jive and even Microsoft have rushed new platforms to market that promise to usher in a new age of openness and collaboration for even the stodgiest of corporations. A lot of organizations are drinking the Kool-Aid and rolling out these tools without much thought as to what they hope to accomplish, much less how Web 2.0 will help. This was the fatal flaw of the knowledge management frenzy of the 90s.  Too much emphasis was placed on tools and technology and too little on the people they were supposed to benefit.  This point is not lost on McAfee.  It is in fact the central premise of his outstanding book.

 

Enterprise 2.0 is not primarily a technological phenomenon.  … The appearance of these novel tools is a necessary but not sufficient condition for allowing new modes of interaction, collaboration, and innovation. … the mechanisms of emergence are organizational and managerial, rather than purely technical.  In other words, leaders can’t simply assume that healthy communities will self-organize and act in a coherent and productive manner after Web 2.0 tools are deployed.

 

Throughout the book, McAfee uses case studies of organizations such as  Google, Serena Software, VistaPrint and the US Intelligence Community, to demonstrate the central concepts of Enterprise 2.0, their benefits and pitfalls and how best to approach their adoption. He defines Enterprise 2.0 as “the use of emergent social software platforms by organizations in pursuit of their goals.” These Emergent Social Software Platforms (ESSP) encompass all of the tools making up Web 2.0, but the concept goes beyond a roster of applications and services.

 

Social software enables people to rendezvous, connect or collaborate through computer mediated communication and to form online communities.  Platforms…are digital environments in which contributions and interactions are globally visible and persistent over time.   Emergent means that the software is freeform and contains mechanisms like links and tags to let the patterns and structure inherent in people’s interactions become visible over time.

 

Since any new paradigm is only as good as the number of catchy acronyms it fosters, McAfee organizes the six most common technical features of these tools under the sobriquet SLATES (Search, Links, Authoring, Tagging, Extensions, Signals). The first four of these features are by now familiar to most people on the web, but the terms Extensions and Signals may not resonate immediately, even though we likely use them daily.  Extensions are the semi-automated classifications and recommendations that drive the “you may also like” features of  Amazon.com, Netflix and StumbleUpon.com.  (ie.  “If you liked this, then by extension you would like this.”)  Signals are simply notifications of new content such as RSS feeds and SharePoint alerts.  While this list of characteristics provides a useful framework for the discussion of Enterprise 2.0  the real value of McAfee’s work is his analysis of how they can be used to achieve an organization’s goals.

Central to McAfee’s conception of ESSPs and the connected enterprise is the concept of tie strength as first articulated in Mark Granovetter’s 1973 study “The Strength of Weak Ties”  (SWT).  I discussed SWT briefly in an earlier post, but in a nutshell tie strength refers to the Bullseye different levels of relationships we maintain in both our personal and professional lives. These range from close friends (strong ties) to casual acquaintances (weak ties).  McAfee also accounts for those people who would be valuable associates if only we knew about them.  Taken together these various types of interpersonal relationships form the “Enterprise 2.0 Bull’s-Eye”.

Most enterprises attempting to promote collaboration focus on teams, usually through providing new tools or sponsoring rope-climbing retreats.  The idea has always been to build tight-knit groups; to reinforce strong ties.  McAfee argues that while these relationships are critical and should be supported, say through the adoption of collaborative authoring environments, weak ties should not be neglected.  The problem is that everyone in a tight-knit group already knows everyone else.  It is far less likely that all my close friends or teammates (strong ties) have relationships with my casual acquaintances (weak ties) the people on the outer-edge of my social network.  Even though my association with these folk of the fringe is tenuous, those relationships provides a link between my core group and others that might not otherwise be available. “Strong ties are unlikely to  be bridges between networks,” McAffee says. “Weak ties are good bridges”  He goes on to cite Granovetter.

 

The weak tie between [a person] and his acquaintance, therefore, becomes not merely a trivial acquaintance tie but rather a crucial bridge between the two densely knit clumps of close friends…these clumps would not, in fact be connected to one another at all were it not for the existence of weak ties. …  social systems lacking in weak ties will be fragmented and incoherent.  New ideas will spread slowly, scientific endeavors will be handicapped, and subgroups separated by … geography or other characteristics will have difficulty reaching a modus vivendi.

 

McAfee also draws on the work of anthropologist Robin Dunbar to bolster his claims about the corporate value of social computing.  According to Dunbar the theoretical maximum social group size for humans is between 100 and 230 people, probably settling at around 150.

 

The figure of 150 seems to represent the maximum number of individuals with whom we can have a genuinely social relationship, the kind of relationship that goes with knowing who they are and how they relate to us.  Putting it another way, it’s the number of people you would not feel embarrassed about joining uninvited for a drink if you happened to bump into them in a bar.

 

By reducing the time and effort it takes to keep track of other people’s activities, both personal and professional, social computing platforms have the potential to dramatically increase the number of people with whom you can have a social relationship, in essence increasing your memory capacity for potentially valuable contacts. Organizations which are too focused on strong ties fail to see the value of these “low-density” networks.  As a result, the platforms that nurture and sustain them, such as Facebook, LinkedIn and Twitter, are viewed with suspicion and are generally considered an unprofitable time-suck for their staff.  McAfee’s case studies give some excellent counter examples, such as Serena Software’s decision to move their entire corporate Intranet to Facebook and bring in the staff’s teenage children to train employees on “Facebook Fridays.”

Organizations often attempt to tap into the third ring of McAfee’s bull’s-eye by creating elaborate, centralized expertise directories.  My experience with such directories has always been that they are extremely difficult to initially populate and even harder to maintain.  As a result, their utility drops off quickly after launch, unless people can be induced to take ownership of their own profiles and keep them up to date on their own initiative.  That is not to say they are not a worthwhile endeavor, but McAfee highlights some interesting alternatives (or at least supplements) that may already be in place, such as document repositories, but that are underutilized.  McAfee demonstrates the successful conversion of potential ties into productive collaborations in both the Google and US Intelligent Community case studies and elaborates on those efforts throughout the book.

The introduction of many of these tools may seem to be old-hat to many readers, but it is a necessary level-setting for the broad readership to which the book seems targeted.  McAfee’s tool overviews are effective and engaging.  They provide all the orientation necessary for newcomers while still treating veterans to fresh insights.  This is important as the real value of the book is found in Part II: Succeeding With Enterprise 2.0. In these final three chapters, McAfee addresses the standard objections to adopting Web 2.0 tools (Won’t staff just use it to gossip? What if somebody says something bad about our company?  Doesn’t this expose us legally?) A long list of objections is presented and while the risks can’t necessarily eliminated, they can generally be adequately mitigated.  On balance, the benefits more than counterbalance the cost. McAffee sums up his advice by saying, “For most organizations, in fact, I believe that these benefits outweigh all the risks.”

Chapter seven lays out a roadmap for succeeding with Enterprise 2.0.  McAfee is quick to point out that successfully adopting these technologies and techniques “can’t be reduced to a single step-by-step recipe.”  Rather he offers recommended steps and a few warnings.  First and foremost is managing your own expectations of how quickly or fully people will embrace a new way of working even if it does make their lives easier.  A new tool or process needs to provide at least a ten-fold improvement over the status-quo in order to be accepted. McAfee cites the research of Harvard Marketing professor John Gourville to support this assertion.

 

Many products fail because of a universal, but largely ignored, psychological bias: People irrationally overvalue benefits they currently possess relative to those they don’t. … [This] leads to a clash in perspectives: Executives, who irrationally overvalue their innovations, must predict the buying behavior of consumers, who irrationally overvalue existing alternatives.  The results are often disastrous: Consumers reject new products that would make them better off, while executives are at a loss to anticipate failure.  This double-edged sword is the curse of innovation.

 

McAfee advises a go-slow approach with a “long haul view.” Determining what you are trying to accomplish in advance of deploying tools is critical, as is reconciling yourself to the fact that the benefits will likely take quite some time to fully realize.  Even when those benefits do materialize they may difficult to quantify.  McAfee advises against attempting to calculate a Return On Investment for Enterprise 2.0 initiatives.  It is much more important to measure progress.  As he says “these [initiatives] require sustained management attention, not just a periodic contest among business cases in which the highest ROI figure wins.”  As I prepare my budget and capital funding requests for the coming year, I will be be passing this book along to both my superiors and subordinates with this passage underlined.


Purchase Enterprise 2.0: New Collaborative Tools for Your Organization’s Toughest Challenges at Amazon.

Book Review: Scientific Collaboration on the Internet

May 3rd, 2009 No comments

Scientific Collaboration on the Internet

Gary M. Olson, Ann Zimmerman & Nathan Bos (Eds.)
The MIT Press, 2008
ISBN-10: 0262151200, ISBN-13: 978-0262151207

 

Science is rarely, if ever, a solo effort and as research projects become increasingly interdisciplinary, the talent required for a given effort is unlikely to be found in at a single physical location. It would seem that with the ubiquity of Information and Communication Technologies (ICT) that this would be a non-issue.  Video conferencing, extranets and email should make remote collaboration simple.  As anyone who has attempted to schedule meetings across time-zones or share files across firewalls knows, this is not the case.  The difficulty of creating and maintaining effective collaborations across geographically dispersed researchers despite the proliferation of collaboration tools, is a recurring theme running through all twenty chapters of Scientific Collaboration on the Internet the latest installment of the Acting with Technology series from MIT Press.

The book has as its genesis the Science of Collaboratories (SOC) project.  The SOC is an NSF funded initiative “devoted to understanding the technical and behavioral principles that can lead to better, more successful design of collaboratories in the future.”  So what exactly is a “collaboratory?”

The term was coined in 1989 by then assistant director of the National Science Foundation, William Wulf, who contributes the forward to Scientific Collaboration on the Internet.

 

A “collaboratory” is a "center without walls" in which the nation’s researchers can perform their research without regard to geographical location—interacting with colleagues, accessing instrumentation, sharing data and computational resources, and accessing information in digital libraries."

 

This definition was refined a couple of years later in a  a 2001 workshop of the SOC project:

 

A collaboratory is an organizational entity that spans distance, supports rich and recurring human interaction oriented to a common research area, and provides access to data sources, artifacts, and tools required to accomplish research tasks.

 

Efforts at establishing technical infrastructures to facilitate collaboration among geographically distributed researchers and facilities have tended to be more-or-less one off efforts tailored to the situation at hand and resources available.The Science of Collaboratories (SOC) project has been striving to change the ad hoc nature of collaboration infrastructure creation and to establish the creation, maintenance and study of these meta-laboratories as a discipline unto itself.

 

The project’s goal was to "define, abstract, and codify the underlying technical and social mechanisms that lead to successful collaboratories." The project members engaged in three primary activities:

  • Qualitatively and quantitatively study collaboratory design and usage
  • Abstract and codify principles, heuristics, and frameworks to guide the rapid creation and deployment of collaboratories
  • Create and maintain a collaboratory knowledge base that designers of future collaboratories can consult

This book is more or less a status report on that mission.  The first two sections, “The Contemporary Collaboratory Vision” and “Perspectives on Distributed, Collaborative Science”  frame the discussion and begin to establish a vocabulary for the conversation.  Fundamental concepts such as e-science, cyberscience and cyberinfrastructure are discussed in detail in the first few chapters and definitions are proposed, but cannot be said to be definitive.  Once the groundwork is established the pertinent concepts are fleshed out into a taxonomy of collaboratories and rolled into a nascent “Theory of Remote Collaboration.”  The tone of these chapters is very academic and not a particularly fun read but they do provide a structure for evaluating the various projects examined in the remainder of the book.

Fifteen diverse collaboration projects are each given their own chapter which are grouped into four categories: Physical Sciences, Biological and Health Sciences, Earth and Environmental Sciences, and The Developing World. The editors pose questions in four areas to each of the collaborative efforts chronicled in this volume.

 

Successes: What success stories are related to the collaboratory?  What has been accomplished in terms of science, technology and improving the human infrastructure?

Failures and challenges: What problems have been encountered?  Which have been overcome and which have not?

The role of Technology:  How were new, or not-so-new, collaboration technologies used in the project? Which technologies were important and which did not perform as anticipated?  What is needed for the future?

Management Practices:  What new management practices were needed to enable long-distance collaborative science?

 

    It may be tempting to focus only on the section pertaining to ones own field.  (I automatically flipped directly to the Biological and Health Sciences) but to do so would be to miss the true value of this volume.  While some of the scenarios may be drawn directly from our day-to-day professional experience, seeing the same fundamental issue in the context of a different discipline can be enlightening.  This is particularly true in the section on The Developing World in which scientists from the industrialized world must work with scientists in the third world who face challenges unheard of in the well-funded west.

    Defining the parameters of this discipline, especially identifying best practices in and IT enabled global science community, is particularly timely.  Enhancing cyberinfrastructure is explicitly called out in the recent ARRA C06 and G20 RFPs as a desirable area of investment.  The National Science Foundation has published a roadmap entitled  “CyberInfrastructure Vision for the 21st Century” which lays out the NSF vision for exploiting a wired world.

    Of course, despite the efforts of Olson, Zimmerman and Bos, their contributors and the greater Science of Collaboration project, the  none of the questions surrounding remote scientific collaboration are answered definitively.  Rather they serve to frame the various reports and highlight common themes.  The main theme that emerges is that facilitating collaboration is not at its heart a technology problem.  In a world of increasing competition for scarce research resources, getting scientists to play nicely together is the perennial challenge of fostering collaboration. As each of the case studies demonstrates, technological barries can generally be overcome, but social and professional hurdles persist.  As the editors note "even when advanced technologies are available, distance still matters."