Ted Leung on the air : Attention information is personal information

Ted Leung on the air
Ted Leung on the air: Open Source, Java, Python, and ...

Fri, 08 Apr 2005

Attention information is personal information

Various people have been discussing the "attention problem" and attention.xml. The basic idea is that the advent of RSS means that we have too much information competing for our attention, and that we need a way to record "attention" data so which could then be used to perform triage of information to be presented to a human user.

The notion of automated triage of information has been around for a while. To an old Usenet junkie like me, the information overload problem and the need for tools to help triage the flow seems like a no brainer. I remember when s(coring)trn newsreader came out (as a set of patches to trn), and when Gnus, the mother of all news readers made its debut. Gnus was one of the first platforms for collaborative filtering, GroupLens, which then went on to become NetPerceptions (which now appears to be defunct). A lot of what is being discussed feels familiar, in concept if not in actual implementation. So what's new here?

Well, strn and Gnus were not instrumented to record attention data. Although I have some doubts about the accuracy of some of the data (like how long did a user read a post), Steve Gillmor seems quite excited about instrumenting clients (like Firefox) in order to obtain this data. Assuming that you could gather meaningful (or mostly meaningful) data, this seems like a good source of data for triage.

Next, you have the notion of distributing / combining / syndicating / bartering / selling the attention data. If you want to do this, having a standardized format for encoding that data seems good, and since adding XML to data always makes things better, you get attention.xml. This part is easy, and most people agree on that. There's a question about where attention data lives and who gets access to it. This is an important question, at least to me. If people are unscrupulous about the (comparatively) small amount of information that I give out about myself, what will happen when they could get their hands on a detailed model of my attention? The thought of storing my attention data in somebody's VC backed server farm doesn't give me the warm fuzzies.

After that, we get to actually using the attention data to perform triage, which is where there is room for experimentation, variation, and market based competition (at least if you believe in exposing your attention stream). Here's where you get into scoring, bayesian filtering, collaborative filtering, reputational filtering and so forth. It's also where you have to deal with issues of granularity, i.e. single posts versus conversations. It's also where you get into potentially innovative presentations as well.

From where I sit, my attention data, my reputation assignments, my triage preferences and so forth are all part of my personal information, and would be something that I would like a personal information manager to manage. I think we have one of those lying around here somewhere...

[00:07] | [computers/internet/microcontent] | # | TB | F | G | 3 Comments |

Hi Ted, nice roundup. I love the idea of 'triage'.

My introduction to commercial collaborative filtering came via NetPerceptions (I had no idea that came out of GroupLens, thanks for the history lesson). IIRC, NetPerceptions provided the core basis for Amazon's own collaborative filtering engine years ago (but which I believe is a very different beast now).

I'm curious tho'; does the subject of automatating the mediating and filtering of information come up around Chandler that much? Or is mainly about providing classification and beter user interfaces?
Posted by Bill de hOra at Fri Apr 8 05:38:05 2005

Regarding the storage and retrieval of Attention.xml, imagine a world where different companies will compete for the ability to retrieve and augment your data with their own area of expertise. You could store your Attention.xml on your personal server or with your web host, but companies could entice you to part with the data for extra information to make your life more useful.

Want to sort your attention index by the authority of the author? Technorati can help with that? Want to sort by the eBay reputation of the author? eBay can assist there. Want to see what other people in your designated social network are paying attention to? A social networking site can assist there.

We make daily choices about the trade-off between convenience and privacy. Attention.xml will be no different but it will have to provide a tangible benefit to the user to part with his or her data.
Posted by Niall Kennedy at Sat Apr 9 19:23:28 2005

Bill,

Yes, the topic does come up. I bring it up all the time ;-). There probably will not be much in 1.0 around those features -- there's too much basic functionality that we need to get working. But we are definitely thinking about those problems, and I am personally highly interested and motivated (1200 RSS feeds worth of motivation) to do some work in that area.

Niall,

Sure, I believe that obtaining ranking information from another reputational authority is a highly useful thing to do. I just want to do it by presenting as narrow a profile to those authorities as possible. All the reputational stuff is (in my mind) highly related to work on digital identity systems. Lots of people think that identity is important for varying reasons. I think it's important because it establishes a foundation for building reputation services.

I think that we're very early in this process and that there is going to be an incredible amount of experimentation (I can't wait to start).
Posted by Ted Leung at Sat Apr 9 23:01:42 2005

You can subscribe to an RSS feed of the comments for this blog:

Add a comment here:

You can use some HTML tags in the comment text:
To insert a URI, just type it -- no need to write an anchor tag.
Allowable html tags are: <a href>, , , , <blockquote>,  , , <code>, <pre>, <cite>,  and .

You can also use some Wiki style:
URI => [uri title]
 => _emphasized text_
 => *bold text*
Ordered list => consecutive lines starting spaces and an asterisk

<	April 2005					>
Su	Mo	Tu	We	Th	Fr	Sa
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30