Hundreds of the world's top websites routinely track a user's every keystroke, mouse movement and input into a web form — even before it's submitted or later abandoned — according to the results of a study from researchers at Princeton University.
And there's a nasty side-effect: personal identifiable data, such as medical information, passwords and credit card details, could be revealed when user's surf the web, without them knowing that companies are monitoring their browsing behaviour.
The Princeton researchers found it was difficult to redact personally identifiable information from browsing behaviour records. Even, in some instances, when users have switched on privacy settings such as Do Not Track.
The research found that third party tracking services are used by hundreds of businesses to monitor how users navigate their websites. This is proving to be increasingly challenging as more and more companies beef-up security and shift their sites over to encrypted HTTPS pages.
To work around this, session-replay scripts are deployed to monitor user interface behaviour on websites as a sequence of time-stamped events, such as keyboard and mouse movements. Each of these events record additional parameters — indicating the keystrokes (for keyboard events) and screen coordinates (for mouse movement events) — at the time of interaction. When associated with the content of a website and web address, this recorded sequence of events can be exactly replayed by another browser.
What this means is that a third person is able to see, for example, a user entering a password into an online form, which is a clear privacy breach. Websites that employ third party analytics firms to record and replay such behaviour are acting, they argue, in the name of "enhancing user experience". The more they know what their users are after, the easier it is to provide them with targeted information.
While it's not news that companies are monitoring our behaviour as we surf the web, the fact that scripts are quietly being deployed to record individual browser sessions in this way has concerned the study's co-author, Steven Englehardt, who is a PhD candidate at Princeton.
"Collection of page content by third-party replay scripts may cause sensitive information, such as medical conditions, credit card details, and other personal information displayed on a page, to leak to the third-party as part of the recording," he wrote. "This may expose users to identity theft, online scams and other unwanted behaviour. The same is true for the collection of user inputs during checkout and registration processes."
Websites logging keystrokes has been an issue known for a while to cybersecurity experts. And Princeton's empirical study raises valid concerns about users having little or no control over their surfing behaviour being recorded in this way.
So it's important to help users control how their information is shared online. But there are increasing signs of usability trumping security measures that are designed to keep our data safe online.
Yijun Yu is senior lecturer, department of computing and communications, at The Open University
Morning & Afternoon Newsletter