Selecting prospects for cross-selling financial products using

Submitted to Expert Systems with Applications

manuscript 1

Selecting prospects for cross-selling ﬁnancial products

using multivariate credibility

Fredrik Thuring, Jens Perch Nielsen, Montserrat Guill´en, Catalina Bolanc´e *

Cass Business School, City University, London, United Kingdom, Fredrik.Thuring.1@city.ac.uk (corresponding author)

Cass Business School, City University, London, United Kingdom, jens.nielsen.1@city.ac.uk

Department of econometrics, RISC-IREA, University of Barcelona, Spain, [email protected]

Insurance p olicies or credit instruments are ﬁnancial products that involve a long-term relationship between

the customer and the company. For many companies a possible way to expand its business is to sell more

pro ducts to preferred customers in its portfolio. Data on the customers’ past behaviour is stored in the

company’s data base and these data can be used to assess whether or not more products should be oﬀered to

a speciﬁc customer. In particular, data on past claiming history, for insurance products, or past information

on defaulting, for banking products, can be useful for determining how the client is expected to behave in

other ﬁnancial products. This study implements a method for using historical information of each individual

customer, and the portfolio as a whole, to select a target group of customer to whom it would be interesting

to oﬀer more pro ducts. This research can help to improve marketing to existing customers and to earn higher

proﬁts for the company.

Key words : Cross-sale selections; Financial services industry; Multivariate credibility.

1. Introduction

Cross-selling means approaching the present customers of a company and encouraging them to

increase their engagement with the company by purchasing one or many additional products. It is

one of the main tools for managers to strengthen the customer relationship (Kamakura et al., 1991).

In the ﬁnancial sector, customers have a long-term relationship with their service provider and

data on their characteristics, transactions, demographics and behaviour is stored in the company’s

data base, see Seng and Chen (2010) and Liao et al. (2011). This information can be used to select

preferred customers and cross-sell them products they do not yet possess.

* The authors acknowledge support by the Spanish Ministry of Science ECO2010-21787-C01-03.

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

2 Article submitted to Expert Systems with Applications; manuscript no. 1

We present a method that describes how to model past behaviour in one or many ﬁnancial

products in order to estimate a customer speciﬁc risk proﬁle for a certain product not yet owned

by him or her, see e.g. Bae and Kim (2010) for other examples of modelling customer behaviour.

Thereafter, the risk proﬁle estimate is used to select which customers, from the company’s portfolio,

to approach and attempt to make a cross-sale. Knowledge about past customer behaviour in one or

many ﬁnancial products is known to explain the performance in other related products, of the same

customer (see e.g. Englund et al., 2009). However, there has been no attempt to implement this

as a cross-selling marketing instrument. Our objective is to show a case study of this method and

to explain how such a system can be implemented in practice. The general procedure is described

in Figure 1 where we see, from data analysis to customer selection, how a ﬁnancial company can

select a target group of customers in order to cross-sell them a certain product. Initially, data

Figure 1 Workﬂow for cross-selling in the ﬁnancial sector

on customers with several products are analysed and a model is speciﬁed. The model predicts an

individual score for each customer with respect to a ﬁnancial product he/she does not own. In our

paper the score is called a risk proﬁle because it predicts the customer behaviour, for a particular

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

Article submitted to Expert Systems with Applications; manuscript no. 1 3

not owned product, given the individual information about the behaviour in other owned and

related products. The company can then select a target group for a marketing campaign based on

the predicted risk proﬁles, oﬀer a speciﬁc product to this group and thereafter the success of the

cross-sale campaign can be analysed to reﬁne the model, see also Malthouse (2010).

In an insurance company the method presented in Section 3 can be used to detect customers

likely to report few insurance claims, with respect to a not yet owned insurance coverage, and

cross-sell them that speciﬁc coverage at p ossibly a discounted premium level. Insurance companies

normally have models for the expected (yearly) claim frequency, given certain characteristics of the

customer and the insured object, which have been estimated based on collateral data on historic

claims reported by past and present customers of the company, see Denuit et al. (2007) for details

on claims frequency models. When predicting the claim frequency of a speciﬁc customer, such

models do not usually take into consideration the individual claims experience of that customer, but

predict the claims frequency based on a risk categorization which is a function of the characteristics

with respect to the customer and the object. Since there can be customers with more or less risk

adverse (individual) behaviour, there are cases for which the claim frequency model over-estimates

or under-estimates the claim occurrence. If, for a certain customer, the claim frequency model

over-estimates the claim occurrence the customer is reporting ”fewer claims than expected”, on

the other hand if the claim frequency model under-estimates the claim occurrence the customer

reports ”more claims than expected”. By knowing about the individual behaviour (more or less

claims than expected) in one or many of a customer’s existing coverages, a similar behaviour can

be expected for another coverage, not yet owned by the customer. For instance, someone who has

a motor insurance policy coverage and who claimed less than expected is probably also going to be

claiming less than expected in other coverages such as house insurance. This phenomenon can be

explained by the attitude that individuals have towards risk (see Slovic et al., 2004 and Harrison et

al., 2007). People that are very much risk adverse drive carefully and also maintain their houses and

belongings in good conditions. As a result, there is a correlation between the number of claims that

they report to their insurance company in two diﬀerent insurance coverages. On the other hand,

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

4 Article submitted to Expert Systems with Applications; manuscript no. 1

some individuals have a completely diﬀerent attitude towards risk, they are more aggressive when

driving and are therefore expected to be careless about their properties too. So, when cross-selling

house coverage to individuals who already have motor insurance with the company, it would be

wise to take into consideration the observed number of car claims (for the speciﬁc customer) in

comparison to the expected number of car claims. Note that the reverse is also true, the number

of past house insurance claims can help to predict future car insurance claims.

A similar argument can be made for the banking sector. Customers that have not defaulted

in the past on their loans and/or have a ﬂawless credit card payment history, are the ones also

expected to be proﬁtable for other credit instruments. As for insurance companies, banks and other

credit institutions have models and assessments for the likelihood of a customer not being able

to repay credit card loans or mortgages and the concept of ”fewer incident than expected” and

”more incidents than expected” is applicable here as well. In the proceeding, we will refer to the all

events leading to a customer induced loss for a ﬁnancial company (insurance claims, loan defaults,

non-repayment of credit card loans, etc.) as incidents.

The rest of the paper is organized as follows. In Section 2 we present the background of cross-

selling and marketing of ﬁnancial products. We show that selecting customers, based on behaviour

in other related products, is an issue that has not been explicitly discussed in existing works. Sec-

tion 2 also provides a short overview of credibility theory, which we use to estimate the individual

risk proﬁle. In Section 3 we brieﬂy show how the risk proﬁle can be obtained, in the cross-selling

case, and Section 4 presents a real case study on customers from the database of a Swedish insur-

ance company. The results illustrate how the methods can be used in practice, they show that

implementation is straightforward and can lead to substantial proﬁt improvement compared to a

strategy, for cross-selling, where customers are selected randomly. Finally, Section 5 concludes.

2. Background

We ﬁrst review recent cross-sale studies and thereafter the concept of credibility theory, which is

the technique used for evaluating cross-sell prospects in this paper.

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

Article submitted to Expert Systems with Applications; manuscript no. 1 5

2.1. Cross sale models

Understanding and using cross-selling techniques is crucially important for a company because as

the customers acquire more products from the same provider, the switching cost, associated with

leaving for a competitor, increases (Kamakura et al., 2003). Therefore, cross-selling is considered

a strong driver for lowering the customer churn, increasing the number of loyal customers and

obtaining higher customer lifetime value (Akura and Srinivasan, 2005). In addition to this, con-

sidering product features allows signiﬁcant contributions for managers striving for valuable and

strong relationship with their current customer base (Larivi`ere and Van den Poel, 2004). Another

important, but not as obvious, beneﬁt from cross-selling is that companies can learn more about

the customers’ preferences and buying behaviour (Kamakura et al., 2003) and cumulate various

types of data to their data warehouse e.g. demographic information (Ahn et al., 2011). Such infor-

mation can be used as explanatory variables to predict certain behaviours of the customers such

as customer retention and proﬁtability outcomes (Larivi`ere and Van den Pol, 2005).

Other studies focus on modeling the probability of a successful cross-sale attempt. In an early

study by Kamakura et al. (1991) probabilistic predictions are made on whether or not a customer

would purchase a particular product/service based on their ownership of other products/services.

In Knott et al. (2002), diﬀerent models are applied to predict which product a customer is expected

to buy next and the approach is further developed in Li et al. (2005), where also the appropriate

time to approach a speciﬁc customer is studied.

Even though many studies have been made on cross-selling as a method for increasing a com-

pany’s revenue, only few discuss potential heterogeneity in the proﬁtability of the cross-sale

prospects. As pointed out in Larivi`ere and Van den Pol (2005), ﬁnancial products are not the

typical grocery products such as milk, coﬀee or cookies, but products that are bought and owned

for a speciﬁc perio d in time. In addition to this, ﬁnancial products are associated with uncertain

costs which are determined at some (uncertain) time after the product is sold. Therefore it is not

guaranteed that a successful cross-sale attempt, to a speciﬁc customer, will generate proﬁt to the

company. Instead if the cross-sold product generates claims (for an insurance company) or a loan

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

6 Article submitted to Expert Systems with Applications; manuscript no. 1

default (for a lending bank) the ﬁnancial product actually generates a loss to the company, in most

cases far greater than the income at the point of sale (insurance premium or interest payment).

Englund et al. (2008) suggest that their multivariate credibility estimator could be used for evaluat-

ing cross-sale prospects by taking into account only information from the other insurance products

of these speciﬁc prospects. The resulting estimate of the risk proﬁle can be used to identify the

expected proﬁtable customers (having less than expected number of claims or loan defaults) and

hence increase the company’s total proﬁt from cross-selling.

2.2. Credibility theory

In actuarial science, credibility theory is a technique widely used to price diﬀerent insurance cov-

erage such as health, life and property insurance (Frees, 2003). In general, the idea is to weight

data, associated with an individual policyholder, with data from a collective of policyholders using

a credibility weight α,

individual estimate = α × individual data +

(

1 − α

)

× collective data .

A historical review of credibility theory starts with the papers by Mowbray (1914) and Whitney

(1918) in which the credibility weight is determined ad hoc, focusing on practical applications, and

not yet founded on concrete mathematical grounds. In B¨uhlmann (1967) (and in the more general

B¨uhlmann and Straub, 1970, where the B¨uhlmann-Straub credibility model is presented) this was

changed by viewing the determination of α as an optimisation problem where only the ﬁrst and

second order moments of the data is needed for the optimal estimator (Norberg, 2004). The gener-

alisation of the credibility estimator to higher dimensions was introduced in Jewel (1973) and later

in a multivariate hierarchical framework by Venter (1985). In Jewel (1989) the speciﬁc problem of

multivariate predictions of ﬁrst and second order are investigated, while a comprehensive reference

to (multivariate) credibility in general is B¨uhlmann and Gisler (2005). A speciﬁc interpretation

of the B¨uhlmann-Straub credibility model is found in Englund et al. (2008) and Englund et al.

(2009) where the dimensions, in the multidimensional credibility model, are interpreted as diﬀerent

insurance coverages, between which the claim occurrence can be more or less correlated.

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

Article submitted to Expert Systems with Applications; manuscript no. 1 7

3. Methodology

We use multivariate credibility theory to estimate a customer speciﬁc latent risk proﬁle and there-

after evaluate if a speciﬁc additional product, of a speciﬁc customer, is expected to contribute

positively to the proﬁt of the company, if that product is cross-sold to the customer. The proﬁt

is measured as the customer speciﬁc deviation between the a priori expected number of incidents

(insurance claims, loan defaults, etc) and the corresponding observed number. In the next para-

graphs we present the methodology brieﬂy and give reference to previous related work on the model

and estimation technique.

3.1. Estimation of the risk proﬁle

We use the standard multivariate B¨uhlmann-Straub credibility model, see e.g. B¨uhlmann and

Gisler (2005, p. 178) and Englund et al. (2008). Individuals i = 1, . . . , I are customers to a ﬁnancial

company and have been so during time periods j = 1, . . . , J

. During these time periods, every

customer has had l = 1, . . . , K diﬀerent ﬁnancial products. We alter between k, k

′

and l as index

for ﬁnancial products in general. For each customer i in time period j and product l, we have

the a priori expected number of incidents λ

ijl

= e

ijl

(

ijl

)

, which depends on the risk exposure

0 ≤ e

ijl

≤ 1, a regression function g

and of a set of explanatory variables Y

ijl

characterising the

customer and the insured object. This can be viewed as a categorisation of the customer and the

insured object into one of a large (but ﬁnite) number of risk categories. The function g

is common

for all customers i and time periods j and can be estimated, using a generalised linear model,

based on collateral data of the company. We assume that e

ijl

can take values between

[

0, 1

]

, where

ijl

= 0 means that the l-th product is not active (not owned) for customer i in time p eriod j and

correspondingly, e

ijl

= 1 means that the product l of customer i is active (owned) during the entire

time period j. We assume a Poisson distribution for the random variable N

ijl

, describing the actual

number of incidents for customer i in time p eriods j and product l. The observation of N

ijl

is n

ijl

Consider another random variable Θ

which represents hidden characteristics such as risk aver-

sion, attitude, etc. that are not captured by the explanatory variables. Θ

random variables are

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

8 Article submitted to Expert Systems with Applications; manuscript no. 1

often called the random eﬀects. Let the pairs

(

1jl

, Θ

)

(

2jl

, Θ

)

, . . . ,

(

Ijl

, Θ

) be indepen-

dent. We assume E

[

ijl

]

= λ

ijl

where E

[

]

= θ

and Cov

[

, Θ

′

]

= τ

′

for k = 1, . . . , K and

′

= 1, . . . , K. Further we assume that the conditional expectation is E

[

ijl

| Θ

= θ

]

= λ

ijl

. The

risk proﬁle θ

describes the risk that is not captured by the model for the a priori expected number

of claims, of customer i and product l, and, as mentioned above, is sometimes called random eﬀect.

We deﬁne F

ijl

as the deviation between the actual number of incidents N

ijl

and the a priori

expected number of incidents λ

ijl

and F

i·l



j=1

ijl



j=1

ijl

Other deﬁnitions, of the deviation between the expected and observed risk, are possible see e.g.

Guill´en et al. (2011). We assume that Cov

[

ijk

, F

ijk

′

| Θ

, Θ

′

]

= 0, for k ̸= k

′

The homogeneous multivariate credibility estimator (1) is the best linear unbiased estimator of

[

, . . . θ

]

′

(see Englund et al., 2009 and B¨uhlmann and Gisler, 2005, p. 181).

= θ

+ α

(

i·

− θ

)

(1)

with θ

[

, . . . θ

]

′

and F

i·

[

i·1

, . . . F

i·K

]

′

. The credibility weight α

= T Λ

(T Λ

+ S)

−1

where

T is a K by K matrix with elements τ

′

, k = 1, . . . , K and k

′

= 1, . . . , K. The matrices Λ

and S

are diagonal matrices with, respectively, λ

i·l

, l = 1, . . . , K and σ

, l = 1, . . . , K in the diagonal. The

parameter σ

= E

[

(Θ

)], where

(Θ

) is the variance within an individual customer

i, for a

product l (for further details see B¨uhlmann and Gisler, 2005, p. 81). We also refer to B¨uhlmann

and Gisler (2005, pp. 185-186) for parameter estimation procedures of the matrices S and T and

the vector θ

Performing the matrix multiplication in

(1) and considering element

k of θ

we get

= θ



′

ikk

′

(

i·k

′

− θ

′

)

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

Article submitted to Expert Systems with Applications; manuscript no. 1 9

where α

ikk

′

is element kk

′

of the matrix α

. This can be rewritten as

= θ

+ α

ikk

(

i·k

− θ

) +



′

̸=k

ikk

′

(

i·k

′

− θ

′

)

. (2)

We now assume that if product k is not active (not owned) by customer i, the risk exposure

ijk

= 0 for all j and consequently λ

ijk

= λ

i·k

= 0. It is p ossible to show that λ

i·k

= 0 implies that

ikk

= 0 and

(

)

becomes

= θ



′

̸=k

ikk

′

(

i·k

′

− θ

′

)

, (3)

where the α

ikk

′

is element kk

′

of α

when taken into consideration that λ

i·k

= 0 in Λ

Equation

(3) shows that even though a customer

i does not have an active product k, it is

possible to obtain his/her speciﬁc risk proﬁle θ

(with respect to product k) by using data of

i·k

′

i·k

′

i·k

′

with respect to the other (owned) products k

′

∈ {1, . . . , k − 1, k + 1, . . . , K}. From a

company’s perspective, customers with a low risk proﬁle are preferred and therefore the estimate

of θ

can be used to assess which customers to cross-sell product k to.

4. Empirical study

In this section we describe the data set collected to test the cross-sale selection methodology and

our experiments with this data. We require a data set describing customers who own more than

one ﬁnancial product.

We conduct the experiment by neglecting the data with respect to one of the products and

therefore imagine that this product is not owned by the customers. Instead the data for the other

products is used to investigate if we are able to identify customers with fewer (or more) than

expected number of incidents with respect to the discarded product.

4.1. Application data

The data sample is collected from the data base of a large Swedish insurance company writing

business in both personal and commercial lines, however our sample consists solely of personal lines

customers. The sample consist of a set of individuals who have been customers to the company

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

10 Article submitted to Expert Systems with Applications; manuscript no. 1

between 1999 and 2004 and who, during this time period, have owned all of the K = 3 main

insurance coverages provided: motor, building and content insurance. The customers have not

owned the coverages for equally long time so the policy duration spans between J

= 3 and J

= 6

years.

We have collected data from I = 3395 customers and for each customer i we estimate the a priori

expected number of insurance claims

ijl

= e

ijl

ˆg

(

ijl

)

(where ˆg

is estimated using a collateral

dataset from the same company) and collect the number of claims n

ijl

for each year j = 1, . . . , J

and for each of the three coverages l = 1 (motor), l = 2 (building) and l = 3 (content). The a priori

expected number of insurance claims

ijl

has been assessed with the claim frequency model ˆg

, in

force at the time, using the characteristics of each customer and insured object. We present the

mean and standard deviation of our data in Table 1, where it can be seen that the mean of the a

priori expected number of claims

ijl

is close to the mean of the observed number of claims n

ijl

with the exception for product l = 2 (building coverage). Note that the standard deviation of the

a priori expected number of claims is lower than the standard deviation of the observed number of

claims, which is the result of the random eﬀects and justiﬁes credibility estimation.

Table 1. Descriptive statistics for Swedish

insurer data from 1999 - 2004

Mean Std. dev.

Motor Expected 0.084 0.053

Observed 0.083 0.295

Building Expected 0.064 0.033

Observed 0.046 0.220

Content Expected 0.051 0.028

Observed 0.052 0.237

4.2. Experiment design and results

Our aim is to replicate the situation where the customers of a ﬁnancial company have a set of

products but lacking one of the products oﬀered by the company. We assume that the company is

interested in selecting customers with fewer than expected number of incidents. The company can

achieve this by estimating the risk proﬁle θ

for each customer i (with respect to the not owned

product k) and select those with low risk proﬁle. With our data set we imagine not knowing about

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

Article submitted to Expert Systems with Applications; manuscript no. 1 11

the data for one of the products k and thereafter estimate the risk proﬁle θ

with data from the

other products 1, . . . , k − 1, k + 1, . . . , K. Thereafter we order the data set by increasing

and

partition it into a certain number M of subsets Φ

(of size ϕ

) with m = 1, . . . , M. The estimate

of the risk proﬁle



′

̸=k

ˆα

ikk

′



i·k

′

−

′



, where

i·k

′

i·k

′

i·k

′



j=1

ijk

′



j=1

ijk

′

. (4)

The partitioning into subsets Φ

is needed for presenting the results in an understandable way,

we used diﬀerent values of M and ﬁnally concluded that M = 5 is an appropriate number of subsets.

In this way, Φ

contains 20% of the customers associated with the lowest

, Φ

contains the next

20%, etc.. The number ϕ

= 679, for m = 1, . . . 5. Since the data sample is ordered by increasing

before the partitioning into subsets Φ

, we expect to capture customers with fewest incidents,

compared to the a priori expected number, in subset Φ

and the customers with the most incidents,

in comparison to the a priori expected number, in subset Φ

. This can be validated by analysing

the observed number of claims n

i·k

in comparison to the a priori expected number λ

i·k

for the

customers in the diﬀerent subsets Φ

, with respect to the previously imagined not owned product

k. For each subset Φ

, we are interested in the deviation ∆ of the observed number of claims in

comparison to the a priori expected number expressed as a percentage as follows,

∆ (Φ

)

= 100









i∈Φ

i·k



i∈Φ

i·k

− 1







, m ∈ {1, 2, 3, 4, 5}. (5)

Figure 2 describes our experiment with the data, for the situation where we are interested in

identifying subsets Φ

for product 2, using data from products 1 and 3. We use the notation

i,213

meaning that the risk proﬁle θ

is estimated using data from products 1 and 3.

It is not uncommon that some customers of a ﬁnancial company only have one of the many

products oﬀered by the company. The presented methodology works in this speciﬁc case as well by

setting e

ijk

= 0 for the all products k which the customers does not own. I.e. for our data sample,

we can also estimate the risk proﬁle θ

by using information from only one of the two remaining

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

12 Article submitted to Expert Systems with Applications; manuscript no. 1

Figure 2 The design of the experiment for the particular case of creating subsets Φ

to Φ

based on the

estimated risk proﬁle for product 2 using information from products 1 and 3.

products in the data set. For instance, for the estimate of the risk proﬁle of product k = 1, θ

, we

use the notation

i,12

if only data from product 2 is used in the estimation, and correspondingly

for the other products.

The evaluation criteria

(5), applied to investigate the deviation between of the observed number

of claims n

i·k

and the a priori estimated expected number

i·k

, in the 5 subsets Φ

, is presented

in Figures 3 to 5. In Figure 3, we imagine that product k = 1 (car coverage) is not owned by

the customers and we use data from either product k

′

= 2 (building coverage) or product k

′

= 3

(content coverage) or data from both building and content coverage to estimate the risk proﬁle

, with respect to product 1. Thereafter, for each of the three diﬀerent estimators, we order the

data set by increasing value of the risk proﬁle estimate and partition the data into the subsets

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

Article submitted to Expert Systems with Applications; manuscript no. 1 13

with m = 1, . . . , 5 for calculation of ∆

(Φ

), see equation (5).

As seen in Figure 3, the credibility estimator

i,12

, which uses data from product 2, does only

slightly diﬀerentiate the customers with respect to claiming (n

i·1

) in comparison to the a priori

expected claiming (λ

i·1

) (left sub-ﬁgure of Figure 3). However, when ordering the data with respect

i,13

, which uses information from product 3, subset Φ

contains customers with on average 6%

lower claims frequency than expected and subset Φ

contains customers with 22% more claims

than exp ected, see center sub-ﬁgure of Figure 3. When using data from both product 2 and 3 the

result is improved slightly and Φ

contains customers with 8% less claims than expected and Φ

contains customers with 26% more claims than expected.

1 2 3 4 5

Ordering by increasing θ

i,12

∆(Φ

) (%)

−20 −10 0 10 20 30

−1 −3 −5 2 2

1 2 3 4 5

Ordering by increasing θ

i,13

∆(Φ

) (%)

−20 −10 0 10 20 30

−6 −11 −6 −8 22

1 2 3 4 5

Ordering by increasing θ

i,123

∆(Φ

) (%)

−20 −10 0 10 20 30

−8 −5 −6 −13 26

Figure 3 Average deviation between observed number of claims and a priori expected number with respect to

product 1 (car coverage). The subsets Φ

are created using only information from building coverage (left

sub-ﬁgure), using only information from content coverage (center sub-ﬁgure) or using information from both

building and content coverages (right sub-ﬁgure)

In Figure 4 we imagine that product 2 (building coverage) is not owned by the customers. We see

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

14 Article submitted to Expert Systems with Applications; manuscript no. 1

that almost all subsets Φ

contain customers with fewer claims than expected because (according

to Table 1) the average value of

i·2

is far greater than the average value of n

i·2

since almost all

customers have reported fewer claims than a priori expected. Still, the credibility estimators

i,23

(center sub-ﬁgure) and

i,213

(right sub-ﬁgure) is able to diﬀerentiate between subsets containing

customers with less than expected claiming and more than expected claiming.

1 2 3 4 5

Ordering by increasing θ

i,21

∆(Φ

) (%)

−80 −60 −40 −20 0 20

−41 −28 −23 −21 −25

1 2 3 4 5

Ordering by increasing θ

i,23

∆(Φ

) (%)

−80 −60 −40 −20 0 20

−43 −40 −35 −36 6

1 2 3 4 5

Ordering by increasing θ

i,213

∆(Φ

) (%)

−80 −60 −40 −20 0 20

−45 −36 −29 −36 3

Figure 4 Average deviation between observed number of claims and a priori expected number with respect to

product 2 (building coverage). The subsets Φ

are created using only information from car coverage (left

sub-ﬁgure), using only information from content coverage (center sub-ﬁgure) or using information from both car

and content coverages (right sub-ﬁgure)

In Figure 5, we imagine that product 3 (content coverage) is not owned by the customers. We

see that all credibility estimators (

i,31

i,32

i,312

) are identifying the customers in subset Φ

having much more claims than exp ected. Especially the estimator

i,312

(right sub-ﬁgure) is able to

identify, in the subset Φ

, customers who have on average 64% more claims than a priori expected

while also identifying the customers in the subset Φ

with on average 10% less claims than a priori

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

Article submitted to Expert Systems with Applications; manuscript no. 1 15

expected.

1 2 3 4 5

Ordering by increasing θ

i,31

∆(Φ

) (%)

−40 −20 0 20 40 60

−3 −13 −14 12 31

1 2 3 4 5

Ordering by increasing θ

i,32

∆(Φ

) (%)

−40 −20 0 20 40 60

2 −7 −28 −15 48

1 2 3 4 5

Ordering by increasing θ

i,312

∆(Φ

) (%)

−40 −20 0 20 40 60

−10 −27 −19 −3 64

Figure 5 Average deviation between observed number of claims and a priori expected number with respect to

product 3 (content coverage). The subsets Φ

are created using only information from car coverage (left

sub-ﬁgure), using only information from building coverage (center sub-ﬁgure) or using information from both car

and building coverages (right sub-ﬁgure)

In Figures 3 to 5 it would be expected and preferred that the deviation of the observed number

of claims in comparison to the a priori expected number, ∆

(

)

, would be lowest for m = 1.

However, this is not the case for many of the estimators and especially for cross-selling product

k = 1 (car) in Figure 3 the lowest ∆

(

)

is recorded for m = 3, m = 2 and m = 4 for the credibility

estimators

i,12

i,13

and

i,123

, respectively. A similar note can be made with regards to Figure

5. We draw the conclusion that for the collected data sample it is more eﬃcient to identify a

small group of customers to avoid to cross-sale to (Φ

) than a small group of customers to target

(Φ

). Consequently, we ﬁnd that by avoiding the 20% of the customers associated with the highest

risk proﬁle estimates

(Φ

) and targeting the remaining 80% the company would increase its

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

16 Article submitted to Expert Systems with Applications; manuscript no. 1

proﬁt signiﬁcantly. In Table 2 we compare ∆

(Φ

) to ∆ (

∪

m=1

) = ∆ (Φ

∪ Φ

) where

∪ Φ

m+1

denotes the union of Φ

and Φ

m+1

Table 2. Percentage deviation between observed and expected number of claims. Note

that a positive value indicates that the subset of customers has reported more claims

a priori expected.

Car Building Content

Order ∆

(

∪

m=1

)

∆

(

)

Order ∆

(

∪

m=1

)

∆

(

)

Order ∆

(

∪

m=1

)

∆

(

)

Random 0% -4% Random -29% -30% Random 4% 3%

i,12

-2% 2%

i,21

-30% -25%

i31

-4% 31%

i,13

-8% 22%

i,23

-40% 6%

i32

-9% 48%

i,123

-8% 26%

i,213

-38% 3%

i312

-13% 64%

Table 2 shows that by selecting the 80% (∪

m=1

) most favorable customers, with respect to

the estimate of the risk proﬁle θ

, the company is able to avoid customers associated with up to

64% more claims than a priori expected (content coverage, product 3). In the table we have also

included results produced when the data sample has been randomly ordered and partitioned into

80% of the data and 20% of the data. The random order does not diﬀerentiate between subsets of

customers with respect to percentage deviation between observed and expected number of claims.

We see a similar pattern for product 1 (car) where the 80% most favorable customers are associated

with 8% less claims than a priori expected while the remaining 20% are associated with 26%

more claims than a priori expected. The performance of the credibility estimators in Product 2

(building) is diﬃcult to interpret because almost all customers are associated with lower observed

claim occurrence than a priori expected. However, even for this particular situation a subset Φ

can

be identiﬁed consisting of customers with on average 6% more claims than a priori expected. Note

that this is not received for the credibility estimator which uses all available information (

i,213

)

but the estimator which only uses data from the content product k

′

= 3,

i,23

5. Discussion

This study investigates identiﬁcation of customers to whom additional pro ducts should be oﬀered,

by estimating a customers speciﬁc risk proﬁle with the use of behavioural data from other products

of the speciﬁc customers. We use a standard multivariate credibility model applied to a portfolio

of customers, of a ﬁnancial company, owning several ﬁnancial products from the company. The

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

Article submitted to Expert Systems with Applications; manuscript no. 1 17

model allows us to take into consideration the possible (positive) correlation in customer behaviour

between diﬀerent ﬁnancial products and estimate the customer speciﬁc risk proﬁles, for a speciﬁc

product not owned by the customer, without having observed any customer speciﬁc information

with respect to that particular product. Instead, data on customer behaviour, with respect to the

other (owned) products, is the only necessity for estimating the risk proﬁle.

The methodology uses only two observables: the a priori expected number of incidents and

the observed number of incidents. We assume that the ﬁnancial company has a model for the

expected number of incidents or is able to assess a value speciﬁc for each customer or category

of customers. When estimating such models it is unusual to incorporate information about the

number of incidents related to a speciﬁc customer. Instead the company ﬁnds patterns which can

be used to categorise the customers, with respect to the expected occurrence of incidents, based

on customers’ characteristics. It is not uncommon that customers are associated with more or less

number of incidents, than suggested by the categorisation, based on their attitude towards risk.

In our methodology we use that the attitude towards risk seems to be similar across diﬀerent

ﬁnancial products. I.e. if a customer is associated with more or less number of incidents, than a

priori expected in some products, it is likely that this pattern will also emerge in other related

products.

With the presented credibility estimators we are able to assign, to each customer, a speciﬁc

estimate of his/her risk proﬁle based on data which the company has available. We use the estimate,

of each customer’s risk proﬁle, to identify subsets from the data containing customers associated

with more or less incidents than a priori expected. In this way the company receives knowledge

about which customer to target for cross-selling and which to avoid.

In our empirical study we analyse our methodology on real data from a large Swedish insurance

company, consisting of personal lines customer with three diﬀerent insurance coverages. We ﬁnd

that there are subsets of the data sample with large heterogeneity with respect to claiming in

comparison to expected claiming. Furthermore, we ﬁnd that these subsets are identiﬁable by using

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

18 Article submitted to Expert Systems with Applications; manuscript no. 1

an appropriate credibility estimator of the risk proﬁles. The appropriateness of a speciﬁc credibility

estimator is dependent of the considered product, but in most cases an estimator which uses

all available information is preferable. We ﬁnd that it is easier to identify the 20% of the data

containing customers to avoid than the 20% of the data containing customers to target. In fact,

by targeting all customers but the worst 20%, the company could expect a subset of customer

associated with less claims than a priori expected indiﬀerent of which product is considered. The

remaining 20% of the data sample consist of customers with up to 64% more claims than a priori

expected.

References

Ahn, H., Ahn, J.J., Oh, K.J. and Kim, D.H. (2011). Facilitating cross-selling in a mobile telecom

market to develop customer classiﬁcation model based on hybrid data mining techniques. Expert

Systems with Applications, 38(5), 5005-5012.

Akura, M.T. and Srinivasan, K. (2005). Research Note: Customer Intimacy and Cross-Selling

Strategy. Management Science, 51(6), 1007-1012.

Bae, J., K. and Kim, J. (2010). Integration of heterogeneous models to predict consumer behavior.

Expert Systems with Applications, 37(3), 1821-1826.

B¨uhlmann, H. (1967). Experience rating and credibility, Astin Bulletin, 4, 199-207.

B¨uhlmann, H. and Gisler, A. (2005). A Course in Credibility Theory and its Applications (Berlin,

Germany: Springer Verlag).

B¨uhlmann, H. and Straub, E. (1970). Glaubw¨urdigkeit f¨ur Schadens¨atze, Bulletin of Swiss

Association of Actuaries, 70, 111-133.

Denuit, M., Marechal, X., Pitrebois, S. and Walhin, J.F. (2007) Actuarial modelling of claim

counts: risk classiﬁcation, credibility and bonus-malus systems John Wiley and Sons, Ltd. New

York.

Englund, M., Guill´en, M., Gustafsson, J., Nielsen, L.H. and Nielsen, J.P. (2008). Multivariate

latent risk: A credibility approach, Astin Bulletin, 38, 137-146.

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

Article submitted to Expert Systems with Applications; manuscript no. 1 19

Englund, M., Gustafsson, J., Nielsen, J.P. and Thuring, F. (2009). Multidimensional credibility

with time eﬀects - an application to commercial business lines, The Journal of Risk and Insurance,

76(2), 443-453.

Frees, E.W. (2003). Multivariate credibility for aggregate loss models. North American Actuarial

Journal, 7(1), 13-37.

Guill´en, M., P´erez-Marn, A.M. and Alca˜niz, M. (2011). A logistic regression approach to estimat-

ing customer proﬁt loss due to lapses in insurance. Insurance Markets and Companies: Analyses

and Actuarial Computations, 2 (2), Forthcomming.

Harrison, G.W., Lau, M.I. and Rutstrm, E.E. (2007). Estimating risk attitudes in Denmark: A

ﬁeld experiment. Scandinavian Journal of Economics 109(2), 341-368.

Jewel, W.S. (1973). Multidimensional credibility. Actuarial Research Clearing House 4.

Kamakura, W.A., Ramaswami, S., and Srivastava R. (1991). Applying latent trait analysis in the

evaluation of prospects for cross-selling of ﬁnancial services. International Journal of Research in

Marketing, 8, 329-349.

Kamakura, W.A., Wedel, M., de Rosa, F., and Mazzon, J.A. (2003). Cross-selling through database

marketing: a mixed data factor analyzer for data augmentation and prediction. International

Journal of Research in Marketing, 20, 45-65.

Knott A., Hayes, A., and Neslin, S.A. (2002). Next-product-to-buy models for cross-selling

applications. Journal of Interactive Marketing, 16(3), 59-75.

Li, S., Sun, B., and Wilcox, R.T. (2005). Cross-selling sequentially ordered products: an application

to consumer banking services. Journal of Marketing Research, 42, 233-239.

Liao, S-H., Chen, Y-J. and Hsieh, H-H. (2011). Mining customer knowledge for direct selling and

marketing. Expert Systems with Applications, 38(5), 6059-6069.

Larivi`ere, B. and Van den Poel, D. (2004). Investigating the role of product features in preventing

customer churn, by using survival analysis and choice modeling: The case of ﬁnancial services.

Expert Systems with Applications, 27(2), 277-285

Larivi`ere, B. and Van den Poel, D. (2005). Predicting customer retention and proﬁtability by

Thuring, Nielsen, Guill´en, Bolanc´e: Selecting prospects for cross-selling

20 Article submitted to Expert Systems with Applications; manuscript no. 1

using random forest and regression techniques. Expert Systems with Applications, 29(2), 472-484.

Malthouse, E.C. (2010). Accounting for the long-term eﬀects of a marketing contact. Expert

Systems with Applications, 37(7), 4935-4940.

Mowbray, A.H. (1914). How extensive a payroll exposure is necessary to give a dependable pure

premium? Proceedings of the Casualty Actuarial Society, 1, 24-30.

Norberg, R. (2004). Credibility theory. Encyclopedia of Actuarial Science. John Wiley & Sons.

398-406.

Seng, J-L., and Chen T.C. (2010). An analytical approach to select data mining for business

decision. Expert Systems with Applications, 37(9), 8042-8057.

Slovic, P., Finucane, M., Peters, E., and MacGregor, D.G. (2004). Risk as Analysis and risk as

feelings: Some thoughts about aﬀect, reason, risk and rationality. Risk Analysis, 24(2), 311-322.

Whitney, A.W. (1918). The theory of experience rating Proceedings of the Casualty Actuarial

Society, 4, 274-292.