Science Addiction

A dormant blog by Devanshu Mehta

Tag: Open Source

On Community-Based Collaboration: Lesson From the OCLC Debacle

Community-based collaboration or “Crowdsourcing” has become the buzzword in many industries- the idea that by fostering a community, you can solve many major problems through their collective wisdom without actually hiring people with… wisdom. Linux, Wikipedia, the recent Twitter Vote Report and many other projects are often cited as successful examples of this.

The nonprofit OCLC has a membership of over 69,000 libraries around the world. These libraries collaborate to create a database– WorldCat— of bibliographies that all the member libraries can use. It is a great system– or at least it was, until the recent introduction of their upcoming use policy. The two major concerns- via Terry’s Worklog– were:

  1. OCLC would require the license to be placed within the record. This takes the ownership of records away from the library and since it is only a link to the license, the license could be changed at any time without the knowledge of the linking library.
  2. WordCat data could not be used for creation of services– even non-profit– that may compete with it.

The first concern has been largely alleviated in a recent version of the OCLC FAQ, but the second one remains. Who really owns the database? Since it only applies to libraries who are members of OCLC (in contract), what prevents someone else from creating a competing service? And finally, can you really copyright a database?

There are many projects out there, like OpenLibrary, that are trying to create a truly open, non-commercial database of books that would run afoul of this clause. In reality, the problem is not in whether it will be enforced but in that this organization believes it is more than the sum of its parts. That OCLC– not its members– controls how and where the data should be used- data that was created by its members.

This is where OCLC is different from free and open source projects like Linux, Wikipedia and every Creative Commons or GPL licensed copyrighted work. There is no right to fork.

To everyone who contributes to community projects:

Always reserve the right to fork.

That is to say, you should always be able to take the marbles and go home. To fork, in open source projects, means to take all the code/data and create another project. This is made possible by the inherent “free”ness of GPL, CC, GFDL and other licenses. Many open source projects have been forked in the past because a sufficient chunk of the community didn’t like the rules they were being asked to comply with. Nobody controlled the code, so everyone controlled the code.

However, in the case of the OCLC debacle, via Annoyed Librarian:

To use a prison metaphor, it’s clear that librarians dropped the soap decades ago.

Or Stefano’s Linotype:

Basically, by using OCLC’s data you agree to protect their existence. And their monopoly (nobody else in the world does what they do, at the scale they do it). And with data that they didn’t even create.

In a time when everyone is using search engines as their first stop in finding answers, closing WorldCat further is a major step backwards. Like many other old-world companies, the OCLC is trying to remain relevant in the face of major paradigm shifts- in this regard, it is much like the Associated Press, which is losing relevance and support from member libraries (thanks Edward Vielmetti). If this was a commercial enterprise built by a million highly paid employees, it would make no difference what they did with their data. But this is a non-profit built on the backs of its members contributions.

As Princess Leia said:

The more you tighten your grip, Tarkin, the more star systems will slip through your fingers.

Resources:

The Year Flash and Real Player Arrived

In a sense, it’s the year Linux arrived. The year Flash and Real Player took Linux seriously. Flash Player 10 was released for Linux the same day as other platforms and earlier in the year Real Player 11 was released as a .deb for Linux. [via Slashdot]

Track Your Stolen Laptop for Free

Researchers at the University of Washington and University of California, San Diego have released a free and open source software called Adeona. It tracks your stolen or lost laptop without relying on proprietary or centralized software or databases. And unlike commercial services, it preserves the privacy of the user- it uses cryptography mechanisms so that only the user has access to the laptop location information.

Adeona is designed to use the Open Source OpenDHT distributed storage service to store location updates sent by a small software client installed on an owner’s laptop. The client continually monitors the current location of the laptop, gathering information (such as IP addresses and local network topology) that can be used to identify its current location. The client then uses strong cryptographic mechanisms to not only encrypt the location data, but also ensure that the ciphertexts stored within OpenDHT are anonymous and unlinkable. At the same time, it is easy for an owner to retrieve location information.

It is licensed under GPLv2, and is available for Linux, Mac OS X and Windows. Like Lojack for cars, simply the prevalence of such software can serve as a deterrent for casual theft. A determined thief can replace the operating system before using (or selling) it, but a lot of consumer electronics theft is casual and opportunistic.

Big Book of Apple Hacks

applehacks.jpg

A long time ago, Chris Seibold, one of my co-writers at Apple Matters, asked if I had any ideas for a book he was writing for O’Reilly publishers called “The Big Book of Apple Hacks“. That email turned in to a little brainstorming which led to five chapters that I have in the book.

  • MacFuse
  • SSH Tunnels
  • ImageMagick
  • Tivo + Mac
  • Fink & MacPorts

Of course, those chapters have much better names in the book. Unfortunately, a publishing error left my name off the acknowledgments at the end of the book (seriously!), but my name is at the end of each chapter I wrote. Hopefully the book will have many reprints in the future which will include my name!

10 Years of The Cathedral and The Bazaar

In May 2007, that seminal work by “Eric S. Raymond”:http://www.catb.org/~esr turned ten years old. _The Cathedral and the Bazaar_ is a book about the simple notion that in software development _given enough eyeballs, all bugs are shallow_. Six years after Linux came on to the scene and 14 years after Richard Stallman gave birth to the GNU project, Eric Raymond put an intangible, untested concept in to words and has arguably had a phenomenal impact on software and geek culture.

When I wrote my “95 Theses of Geek Activism”:http://www.scienceaddiction.com/2006/07/23/95-theses-of-geek-activism/ last year, I put in CatB as a required reading as thesis #12 (the order meant nothing!). It could well have been #1, because it was the book that, for me, transformed the open source model from a touchy-feely philosophy to a practical, viable and achievable ideal for software development.

When Richard Stallman introduced the GNU project, it was a philosophy. You stuck with the GNU model because you believed in truth, liberty, freedom and justice. The BSD and other licenses were less philosophically rigid and have hence been taken advantage of by companies. Apple based their operating system OS X on BSD but were not obligated to share their improvements with the BSD community. They could take, but did not have to share. The GPL aimed at changing that- sharing was a many way street.

Linux brought the truly bazaar-style development in to the (geek) mainstream- where every user was a developer and the code was _released early and released often_. These facets of Linux development were part accidental, part consequences of the GPL and part Linus’ genius. Of course, Raymond was the first to test and formally describe the theories behind the success of Linux and how to apply them to future projects. Raymond tested the bazaar philosophy on his own _fetchmail_ project and the book tracks his success with it.

* *CatB as a Manifesto*: This book changed the geek language. Phrases such as the one above about eyeballs and bugs or the fundamental ideas about how to treat your beta testers are now treated as obvious. Indeed, even Yahoo and Google use the idea of treating their users as insider beta testers for many of their products.
* *CatB and O’Reilly*: _The Cathedral and the Bazaar_ was the first book published in print (by O’Reilly) with an open source document license. This allowed the book to be copied and modified as long as the resulting work had the same license- a precursor to “Creative Commons”:http://www.creativecommons.org licenses.
* *The Open Sourcing of Netscape*: The open sourcing of the Netscape browser and the start of the Mozilla project at the end of the browser wars in the late 90s is largely attributed to this book. At the time, CTO of Netscape, Eric Nahn told Raymond, “On behalf of everyone at Netscape, I want to thank you for helping us get to this point in the first place. Your thinking and writings were fundamental inspirations to our decision.”

Eric Raymond first presented _The Cathedral and the Bazaar_ at the “Linux Kongress on May 22nd, 1997”:http://www.linux-kongress.org/1997/ in Würzburg, Germany. Ten years later, Linux is more powerful than ever, Ubuntu is ready for the desktop (says me) and the bazaar model is alive and thriving.

* Read “The Cathedral and the Bazaar”:http://www.catb.org/~esr/writings/cathedral-bazaar/cathedral-bazaar/
* About the “Netscape decision”:http://www.catb.org/~esr/writings/cathedral-bazaar/cathedral-bazaar/ar01s13.html
* Raymond’s ever-growing archive of “notes, comments, rebuttals and more”:http://www.catb.org/~esr/writings/cathedral-bazaar/ on CatB.
* Read Linus Torvalds’ fun, light and frothy autobiography about the heady young days of Linux “Just for Fun”:http://www.amazon.com/exec/obidos/ASIN/0066620732/galaxyfaraway
* And of course, there is the “The Circus Midget and the Fossilized Dinosaur Turd”:http://fringe.davesource.com/Fringe/Computers/Philosophy/Cathedral_Bazaar/Parody.html (don’t worry, it’s satire and it has a point)

The Trouble with Diebold and Electronic Voting

First, take a look at this “graphic at Washington Post”:http://www.washingtonpost.com/wp-dyn/content/graphic/2006/03/16/GR2006031600213.html sensationally titled *How to Steal an Election*. It compares Nevada laws regarding slot machines with state and federal laws regulating electronic voting machines. The comparison is stark and eye-opening. For example, the Nevada Gaming Commission has access to all software for gaming electronics but the voting machine code is a _trade secret_. Yes, and so are the inner government workings of China.

And then there is the most famous of the voting machine manufacturers, “Diebold”:http://www.diebold.com. In addition to being a closed system that even the government is not allowed to know about, it is a company that fundamentally misunderstands electronic voting in particular and security in general.

For example, “this came from a Diebold spokesman”:http://www.nytimes.com/2006/05/12/us/12vote.html?ex=1305086400&en=5b3554a76aad524a&ei=5090&partner=rssuserland&emc=rss [via Schneier]

“For there to be a problem here, you’re basically assuming a premise where you have some evil and nefarious election officials who would sneak in and introduce a piece of software,” he said. “I don’t believe these evil elections people exist.”

Ah yes, the head-in-the-sand form of security. I hear it is very popular in the real world and effective in utopian societies.

The fun and games do not end there. A “major vulnerability”:http://www.insidebayarea.com/ci_3805089 was found in the voting machines:

This newspaper is withholding some details of the vulnerability at the request of several elections officials and scientists, partly because exploiting it is so simple and the tools for doing so are widely available.

Of course, the “report appeared later on”:http://www.blackboxvoting.org/BBVtsxstudy.pdf with parts redacted, and it is tremendous.

DRM: Why Apple Has it Wrong

A few days ago, a Janet Meyer article on “Apple Matters”:http://www.applematters.com/index.php/section/comments/is-big-brother-on-your-ipod/ sparked a phenomenal discussion for and against Apple’s DRM policies. If you are not aware, Apple has a *Digital Rights Management* [DRM] system that “protects” the music it sells from the iTunes music store: it determines where you can play it, how you can play it, how many machines you can play it on and so on. It is proprietary, so if Apple controls the online music market, Apple also automatically controls how, where, why, when we listen to music we buy from them.

To cut a long story short, Janet was making the point that Apple may have a closed music format but as long as consumers have no trouble with it, consumers have a choice to buy CDs instead, the market will decide what is best.

Ah yes, the market. That all-knowing, all-seeing, all-singing, all-dancing market. It knows. In a _perfect_ society with fully informed consumers who have _true_ choices, the market knows. Elections would be marvellous with fully informed voters with _true_ choices as well Read the rest of this entry »

Linus on CNN

“Linus Torvalds is on CNN this weekend”:http://edition.cnn.com/2006/BUSINESS/05/18/global.office.linustorvalds/ and though the interview is largely unsurprising, he is always charming and self-deprecating- taking as little credit for the “revolution” as possible.

Timings for the interview on CNN International:

You can watch the Linus Torvalds interview on Global Office on CNN International at these times:
*Saturday*
03:30 ET/08:30 BST/1530 HKT/1300 New Delhi
09:30 ET/14:30 BST/2130 HKT/1900 New Delhi
*Sunday*
07:30 ET/12:30 BST/1930 HKT/1700 New Delhi
13:30 ET/18:30 BST/0130 (Monday) HKT/2300 New Delhi

April Fool’s Day 2006 for Geeks

It seemed as though yesterday was an overactive April Fool’s day; maybe because it was a Saturday or because I was actively seeking pranks. And I found many.

Some of my favorite pranks for this year: Read the rest of this entry »

Fetch Your Netflix Ratings

If you use the “Netflix”:http://www.netflix rating system even half as much as I do, your account has information documenting your entire movie watching life. At this point in time, I have rated 1348 films on Netflix and that number grows by about 15 per month- considering I am a movie geek and a statistics geek, that information is important- nay, vital– to me! If only there was a simple, friendly way to get at my information… Read the rest of this entry »