首页> 外文会议>International Conference on Telecommunication and Networks >Bank loan analysis using customer usage data: A big data approach using Hadoop
【24h】

Bank loan analysis using customer usage data: A big data approach using Hadoop

机译:银行贷款分析使用客户使用数据:使用Hadoop的大数据方法

获取原文

摘要

As of now, currently there is a tremendous rise in the economy development due to which there has been a huge rise in the requirement of the personal loan of customers as the behavior of the borrowers have uncertainty and fuzzy nature. For both lenders and borrowers, credit risk is a major challenge, which directly or indirectly affects the reliability of the banks. Present article has concentrated on menace by granting loans to the customers, risk related to the investors. The objective of this paper is to analyze the credit risk and loan performance of the “Lending Club” company which is one of the biggest market place for online credit. Analyses of the performance of the bank loan and credit risk on the large dataset having 112 attributes which have been collected from the Lending Club of the period 2012 and 2016. In this paper, Hadoop approach has been used and for applying Hadoop methodology we will be using the Cloudera software which is an open source platform for analyzing the data. It supports the Hadoop ecosystem which is used for the managing, storing and analyzing the large volume of data. In this article, we used the Hive which is data warehouse system and which is used for managing and analyzing the data stored in HDFS (Hadoop Distributed File System) using HiveQL. To understand the performance of the bank loan data we had performed various analyses on the collected dataset of the bank.
机译:截至目前,目前在经济发展中存在巨大的崛起,因为这些客户的个人贷款要求巨大上升,因为借款人的行为具有不确定性和模糊性质。对于贷方和借款人来说,信贷风险是一项重大挑战,直接或间接影响银行的可靠性。本文通过向客户授予贷款,与投资者有关的风险来集中在威胁。本文的目的是分析“贷款俱乐部”公司的信用风险和贷款绩效,该公司是最大的在线信用市场之一。分析来自2012年和2016年期间的贷款俱乐部的大型数据集的银行贷款和信用风险的表现。在本文中,Hadoop方法已被使用,并用于应用Hadoop方法使用Cloudera软件是一个开源平台,用于分析数据。它支持Hadoop生态系统,用于管理,存储和分析大量数据。在本文中,我们使用了数据仓库系统的蜂巢,用于使用HiveQL管理和分析存储在HDFS(Hadoop分布式文件系统)中的数据。要了解银行贷款数据的表现,我们在收集的银行数据集上进行了各种分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号