ClickCease

Big Data, Digital Footprints, and Credit: Looking To The Future

A new study by the National Bureau of Economic Research has found that credit scores aren’t the only indicator of credit risk – your digital footprint might work as well.

October 1, 2021
Prove
Learn More about the author: Big Data, Digital Footprints, and Credit: Looking To The Future
Share:

People seem to view their credit score as sacred; they’ve cultivated the number for years, check its progress every few months, and pray it doesn’t dip too low. Yet, if you want to own a home, qualify for a reasonable home insurance rate, finance a new car, or even sign an apartment lease, there’s a good chance the score serves as a gatekeeper to these experiences.

This is because companies need to quantify client risk. And credit scores have served as a solid predictive model for decades.

But a new study by the National Bureau of Economic Research has found that credit scores aren’t the only indicator of credit risk – your digital footprint might work as well. The research paper called On the Rise of FinTechs Credit Scoring using Digital Footprints by Tobias Berg, Valentin Burg, Ana Gombovic, and Manju Puri tested a different approach: instead of using credit bureau information to predict consumer credit risk, they used digital footprints.

Digital footprint variables are highly valuable for default prediction

Researchers studied a sample of over 270,000 purchases from a German e-commerce company that allows its customers to buy furniture and pay for it later. The transaction works a bit different than your average furniture sale, however. The company analyzes the customers’ digital footprint and uses traditional credit scores to determine creditworthiness.

The researchers decided to look more closely at several facets of a consumer’s digital footprint. These variables contain information stored in cookies, browsers, and online registration. The computer/browser data include device type, operating system, and browser type, while cookies contain the channel through which the customer visits the web page, time spent on the page, and check-out time. Researchers used the email address, the email provider’s name, and information about the customer’s writing from the registration footprint.

Besides the digital footprint variable, the dataset contained data from a private credit bureau that compiles a score similar to a FICO score in the United States.

The results were significant. The authors claim that digital footprint variables are highly valuable for predicting defaults.

They found that the difference in default rates between customers using iOS and Android is equivalent to the difference in default rates between a median FICO score and the 80th percentile of the FICO score.

Also, customers who use internet and phone service providers that provide services to affluent customers are about half as likely to default as the average customer; this is equivalent to the difference between a mean FICO score and a FICO score at the 10th percentile.

Email addresses are also useful in predicting defaults as customers with their names in the email address are 20% less likely to default. This is similar to the discovery of Belenzon, Chatterji, and Daley (2017), who claim that companies that carry their founder’s name have superior performance.

The customer’s behavior during the registration process and the type of websites that brought them to the registration page are also significantly related to defaults. Those coming through targeted ads were more likely to default than those who visited price comparison websites.

A model that takes into account all variables of digital footprints outperforms the information content of credit bureau scores

A model which uses all variables of digital footprints outperforms the information content of credit bureau scores. The area-under-curve of such a model is 69.6%, significantly higher than the area-under-curve of 68.3% of the FICO score.

image_preview (1)-9a664c.png

The FICO score is not available for unscorable customers, and in the dataset used for the study, ~15,591 observations were unscorable. It turned out the digital footprint variables for unscorable customers are more useful than for scorable customers as the area-under-curve for unscorable customers is 74.4% vs. 69.7% for scorable.

The authors find their results remarkable because they use simple and easily available information, but also because the results are robust to out-of-sample tests and therefore not driven by over-fitting in the sample. They are also robust to various default definitions, and various sample splits.

What’s next

In a way, the results are entirely unsurprising. A person’s digital footprint exposes their level of affluence through their browser and mobile device type. The richer a person is, the easier it will be to make monthly payments. So what?

But to me, it comes down to two questions:

1. What e-commerce or FinTech companies will decide to do with this data.

Outside of making credit risk determinations for their own companies, there’s no question that it’s incredibly valuable to other businesses. It could allow other e-commerce companies to bypass credit report pulls and other bureaucratic processes to sell or finance their products.

2. These findings are a win for business, but is it a win for the consumer? I would say that it depends on which consumer you ask.

The credit score determination process is already fraught with socioeconomic and racial bias. And, because a digital footprint is tied to shallow affluence indicators, I don’t see how using it overcomes this bias for those with already low credit scores.

The authors argue that this may help the 2 billion adults worldwide without access to any credit or formal financial services, mentioning that it will foster financial inclusion and lower inequality. However, this statement seems like a bit of a reach; those without access to credit may not have the ability to optimize their digital footprints.

But with this information, those with subprime credit scores or no credit history may be able to change their digital behavior to overcome credit denials. It’s possible that a simple switch from an Android browser to an iOS browser leads to more approvals and lower interest rates. That’s a pretty simple switch. However, this could turn into a rat race between the consumer with no clear winner in sight over time.


To learn about Prove’s identity solutions and how to accelerate revenue while mitigating fraud, schedule a demo today.


The modern
way of proving identity

Trusted by 2500+ leading companies to reduce fraud and improve consumer

Prove

Keep reading

See all blogs
Read the article: Liminal's Friday Five: Why Continuous Authentication Shouldn't Add Friction
Company News
Liminal's Friday Five: Why Continuous Authentication Shouldn't Add Friction

In this edition of Liminal's Friday Five, Prove CEO Rodger Desai shares why the future of fraud prevention depends on reducing friction without sacrificing security, using persistent digital identity, network intelligence, and continuous trust to stop AI-driven fraud. The conversation explores identity verification, authentication, customer experience, and the strategies financial institutions and digital businesses need to build secure, seamless user journeys in the AI era.

Company News
Read the article: The Silent Drain: How SMS Pumping Is Bleeding Digital Marketplaces Dry
Blog
The Silent Drain: How SMS Pumping Is Bleeding Digital Marketplaces Dry

SMS pumping fraud is silently increasing verification costs for digital marketplaces by exploiting OTP workflows. Explore how these attacks operate, why traditional SMS authentication is failing, and how proactive phone intelligence can prevent fraud before an SMS is sent.

Blog
Read the article: Prove and Baselayer Partner to Bring Real-Time Business Verification to ProveX
Blog
Prove and Baselayer Partner to Bring Real-Time Business Verification to ProveX

Prove and Baselayer simplify business verification by combining trusted identity, real-time KYB intelligence, and seamless onboarding into a single workflow without requiring additional verification steps.

Blog