Enter your email address:

Delivered by FeedBurner

AddThis Social Bookmark Button

Support


Books To Read

July 2009

Sun Mon Tue Wed Thu Fri Sat
      1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30 31  

« JOURNAL: A new generation | Main | FAILED STATES vs. ROGUE STATES »

Monday, 02 August 2004

DATAMINING THE TERROR WEB

INTELLIGENT DEFENSE: mark_lastThe Internet and the WWW are critical to terrorist/global guerrilla communications. On the 8th of August, I will talk with Dr. Mark Last of Ben-Gurion University of the Negev in Israel. He and his team are working on a project to apply advanced data mining techniques to unearth terrorist activity on the Internet.

Dr. Last co-authored a paper, "Using Data Mining Techniques for Detecting Terror-Related Activities on the Web" (PDF) that provides detail on the approach. This proposed system works in conjunction with an ISP (with or without their permission) to determine potential terrorist activity. If applied to ISPs (and potentially Internet Cafes) across the Middle East, it might provide a means of detecting patterns of behavior that may be used to prevent attacks.

Here is Dr. Last's biography and a list of papers he has published.

Here's the audio of the conversation. Please check the Intelligent Defense site for updates on the program.

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d83451576d69e200d8343039f353ef

Listed below are links to weblogs that reference DATAMINING THE TERROR WEB:

Comments

Interesting, but I have serious doubts that this approach can actually work in practice.

Who exactly makes the decision as to what constitutes a "terrorist website" and what is free speech ?

It is difficult enough tuning an Intrusion Detection System or an Intrusion Prevention System to understand what is "normal" behavior within a small network of real users connected to the internet.

Trying to apply similar techniques at the scale of a whole ISP or of multiple ISPs or in multiple countries, seems to be a hugely expensive and intrusive endeavour. Not even the communist Chinese or the Saudia Arabian national level content filters seem to be able to achieve this in their attempts to censor various well known (fixed) political and news websites.

It is the sheer amount of internet traffic that foils attempts to classify or analyse
"suspicious" behavior by a tiny minority, who are highly motivated to obscure their activities.

Even conducting "traffic analysis" of who accesses suspect websites and when, is fraught with issues of scale and cost.

The UK Government reluctantly modified its original "voluntary" Data Retention scheme, which it originally planned to force ISPs to retain , amongst many other categories of data, web server access logs for 12 months beyond the time which they needed it for normal business purposes.

After much discussion, this figure was revised to something more reasonable technically and financially: - 4 days.

Neither the UK Government nor the UK ISP industry can be accused of being "soft" on terrorist or child porn websites or the people who try to access them, but the task is just too difficult and expensive.

Classifying internet users simply into "terrorists" or "non-terrorists" is wrong, especially if the intention is to build an automatic system which has potentially dire consequences for the False Positives.

Content filtering "nanny ware" is easily fooled, and the same techniques would apply to this datamining approach, without even the difficulty of having a "nanny ware" snooping filter on the client PC being used to access the suspect websites.

What if thee terrorists or their supporters use Secure Sockets Layer (SSL) or Transport Layer Security (TLS) which end to end encrypts the clickstream (necessary for credit card e-commerce etc.) and is built into almost every web browser.

Even simple password or cookie registration based access to private areas or discussion forums etc. will prevent accurate cluster analysis by preventing the suspect material from being analysed in the first place.

What if instead of static web pages, the content is dynamically generated by a back end database ?

Surely the "Access Vector" hash function of the content of the web pages is trivial to fool, using existing readily available programs which insert random dictionary text into HTML tags or into hidden non-displaying text (e.g. black text on a black background), in order to fool Baysian email spam filters ?

What if the ISP that is being used to access the alleged "terrorist web sites" uses proxy servers ?

What if the people accessing these dubious websites use the well established techniques of chains of proxy servers or of a list of open proxy servers which changes every few seconds or every web page access, as used by spammers etc. ?


Great questions WTWU. Maybe some could be asked during the interview?

Data mining is great for marketing where false positives are a mere nuisance, but here...

"What if instead of static web pages, the content is dynamically generated by a back end database?"

I think a few of the AQ sites actually are database driven. And yes, ME users are likely to be familiar with using proxy servers, as they are accustomed to getting around filters for other reasons.

The comments to this entry are closed.

My Photo

On Brave New War

  • Purchase Brave New War
  • New York Times Op-Ed
    ...a fast, thought-sparking book.. -- David Brooks
  • Greenpeace
    I read it twice and bought six copies for my friends -- John Passacantando (Exec. Dir. Greenpeace)
  • G. Gordon Liddy Show (radio)
    ...this is a seminal book in the truest sense of the term.. way ahead of the curve... go out and buy it right now -- G. Gordon Liddy
  • City Journal
    Robb has written an important book that every policymaker should read -- Glenn Reynolds (Instapundit)
  • Small Wars Journal
    Without reservation Brave New War is for professional students of irregular warfare and for any citizen who wants to understand emerging trends and the dark potential of 4GW -- Frank Hoffman
  • Scripps Howard News Service
    A brilliant new book published by terrorism expert John Robb, titled "Brave New War," hit stores last month with virtually no fanfare. It deserves both significant attention and vigorous debate... - Thomas P.M. Barnett
  • Chet Richards DNI
    John has produced an important book that should help jar the United States and other legacy states out of their Cold War mindset. You can read it in a couple of hours – so you should read it twice...
  • Washington Times / UPI
    Robb correctly finds the antidote to 4GW not in Soviet-style state structures such as the Department of Homeland Security, but in decentralization -- William Lind (the father of 4th generation warfare).
  • Robert Paterson
    Having painted a crystal clear picture of how a war of networks is playing out, he comes to an astonishing conclusion that I hope he fills out in his next book.
  • The Daily Dish
    John Robb of Global Guerrillas has written the most important book of the year, Brave New War. - Daily Dish (The Atlantic)
  • Simulated Laughter
    Well-written. Brave New War reads more like an action novel than a ponderous policy book. - Adam Elkus
  • FutureJacked
    Go buy a copy of this book. Now. If you are low on cash, skip a few lunches and save up the cash. It is worth it. - Michael Flagg
  • ZenPundit
    The second audience is composed of everyone else. Brave New War is simply going to blow them away. - Mark Safranski
  • Haft of the Spear
    There aren’t a lot of books that make me recall a 12-year-old self aching for the next issue of The Invincible Iron Man to hit the shelves. Well done. - Michael Tanji
  • Ed Cone
    His book posits an Army of Davids -- with the traditional nation state in the role of Goliath. - Ed Cone (Ziff Davis)
  • The Newshoggers
    I highly recommend reading and re-reading this work. - Fester
  • Shloky.com
    This is the first real text on next generation warfare designed for the general population and it sets the bar high for following acts. It is smart, it is a short read, and it will change your thinking. - Shlok Vaidya
  • Politics in the Zeros
    I suggest this is something Lefties need to start thinking about now, as that decentralized world is coming. - Bob Morris
  • Hidden Unities
    A thoughtful book that should be read more widely than the latest Tom Friedman whopper, Chalmers Johnson scare tale or Bill Kristol hack fest. - EB

Stats


Stats2