- Lenders: (1) lender-loan data where each row is a contribution towards a loan made by a particular lender, repeated rows for each lender, N = ~4 million (there are ~500K lenders, who have about 8 contributions to loans on average) (2) random sample from (1), start with n = 10K, if file (5) below is too large, tweak down (3) lender description data with detailed text on why each lender lends on Kiva, N = ~500K - Loans: (4) loan repayment data with repeated rows for each loan, N = ~1.28 million (there are ~100K loans with many payments on each) (5) sample of loan repayment data from (4) that corresponds to loans listed in (2), goal is to keep n < 150K (6) loan description data with detailed text on why each loan is requested - Partners: (7) information on local NGOs, N = ~150