首页> 外文学位 >Construction legal support for Differing Site Conditions (DSC) through statistical modeling and machine learning (ML).
【24h】

Construction legal support for Differing Site Conditions (DSC) through statistical modeling and machine learning (ML).

机译:通过统计建模和机器学习(ML)为差异站点条件(DSC)提供建设法律支持。

获取原文
获取原文并翻译 | 示例

摘要

Construction is one of the industries with a major contribution to the nation's economy. It is estimated that the world construction market has reached US ;Construction disputes are ultimately resolved in courts unless a private construction contract calls for other resolution mechanisms. In fact, some in the construction industry prefer litigation; however, their preference comes at great cost. Despite the numerous advantages of litigation, which includes being the most formal and binding process, it has two main shortcomings, which make the process undesirable and unsupportive of the growth and development of the construction industry. First, depending on the jurisdiction, complex construction disputes may take anywhere from two to six years before they reach trials. Second, the prolonged, detailed, factual discovery process makes litigation very expensive due to the need for specialized personnel with extensive legal knowledge and construction experience, a combined skill set that is not widely available in the industry. In order to overcome these major drawbacks that impact the construction industry's advancement and contribution to the nations' economy, legal decision support systems are needed to effectively and efficiently mitigate these shortcomings and in turn allow for better control and management of construction projects.;In construction disputes the initiation of the conflict can be attributed to a number of reasons including: change orders, escalation, and differing site conditions, etc. Each of these reasons leads to a separate method for addressing and handling the disputes and accordingly, each reason can be considered as a different dispute type. Among these types, one of the most important and frequently occurring disputes is Differing Site Conditions (DSC) which results from contractors encountering conditions materially different from those expected or described by the owner. This warrants special attention to this kind of dispute due to their potential for deviating construction projects from their planned time and cost.;A number of researchers in Artificial Intelligence (AI) fields have developed tools and methodologies for modeling judicial reasoning and predicting the outcomes of construction litigation cases in an attempt to provide the above mentioned decision support capabilities. Despite the significant contributions of these systems to the advancement of legal decision support capabilities in construction, their success was limited because they were not based on a detailed analysis of legal concepts that govern litigation outcomes.;Consequently, the objective of this dissertation is to provide a coherent and integrated methodology for construction legal decision support for Differing Site Conditions (DSC) disputes through statistical modeling and machine learning. To attain this goal, the current study designed and implemented a 4 step methodology targeting the following goals: (1) to extract a comprehensive set of legal factors that govern DSC litigation outcomes in the construction industry; (2) to devise a litigation prediction model for DSC disputes in the construction industry based on the extracted set of legal factors; (3) to create a methodology for automated extraction of significant legal factors that governs DSC litigation outcomes from case documents; and (4) to develop an automated retrieval model for identifying DSC precedent cases from a large corpus based on similarity to newly introduced ones. The 4 steps of this methodology were implemented incrementally, and each step relied on the outcome of its predecessor.;First, a comprehensive set of significant legal factors that govern DSC litigation cases verdicts were extracted through statistical modeling. Binary Probit and Logit Choice Models were developed (a) to identify the effect of each extracted factor on the prediction of the winning party; (b) to identify the best combination of factors with the highest significance on the prediction model; and (c) to perform a sensitivity analysis to prioritize the most significant legal factors. Among the main findings of this step are (1) in general, cases in which the Federal Government is a party of the dispute, judgments are in favor of the government (owner) over contractor; (2) "the presence of evident facts that the encountered conditions caused a change in the nature and cost of the contract" had the highest impact among variables causing a decrease in the prediction of judgment in favor of the owner, and causing an increase of 17.77% in prediction on favor of the contractor; (3) "the presence of evident facts that the specifications included a warning against the presence of DSC from those conveyed in the contract documents" caused the highest increase in the prediction of judgment in favor of the owner amounting to an increase of 56.56%; and (4) the development of Binary Probit and Logit Choice Models extracted a joint set of 13 statistically significant legal factors related to DSC disputes in the construction industry. This set provided the grounds for the other three steps of the current research methodology.;Second, an automated litigation prediction model for DSC disputes in the construction industry through machine learning was developed based on the identified factors in the first step. The framework under this step incorporates analysis of different machine learning methodologies including support vector machines (SVM), Naive Bayes (NB), and rule induction classifiers like Decision Trees (DT), Boosted Decision Trees (AD Tree), and PART. Ten machine learning models were developed using these machine learning methodologies to evaluate the best methodology for predicting litigation outcomes. The analysis of all developed models showed that the SVM Kernel Polynomial 3rd degree model has the best performance. This model attained an overall prediction accuracy of 98%.;Third, an automated significant legal factors extraction model for DSC disputes in the construction industry through machine learning was developed. The framework under this step (1) developed 24 machine learning models in which 4 weighting schemes namely Term Frequency (tf), Logarithmic Term Frequency (ltf), Augmented Term Frequency (atf), and Term Frequency Inverse Document Frequency (tf.idf) were implemented for each type of classifier; and (2) developed two C++ algorithms for the preparation of the corpus and implementation of the required weighting mechanisms. The highest prediction rate of 84% was attained by NB classifier while implementing tf.idf weighting. The model was further validated by testing newly un-encountered cases, and a prediction precision of 81.8% was attained.;Finally, the fourth step of the methodology developed an automated machine learning model for the retrieval of supporting DSC precedent cases from large corpi. This step, therefore, (1) implemented Latent Semantic Analysis algorithm; and (2) developed 9 reduced feature spaces with feature sizes of 5, 10, 15, 20, 100, 200, 300, 400, and 500 for analysis and validation of the implemented algorithm. Among the findings of this step are (1) low dimension reduced feature spaces are more representative of documents closely related to the domain problem; (2) high dimension reduced feature spaces, are more representative to domain problems modeling dispersed and unrelated document collections; and (3) LSA reduced feature space of 10 features is the best reduced feature space to adopt for automating the extraction of similar DSC cases from a large corpus.;The main research developments of this research contribute to the advancement of the current state of the art in construction legal decision support and Knowledge Management (KM) in the construction legal domain by developing much needed systems for (1) litigation outcomes prediction; (2) automated legal factor extraction; and (3) automated precedent case retrieval. Those developments hold promises to decrease the costs of legal experts in the construction industry by decreasing time spent on non-value adding tasks such as documents analysis, and offering initial estimates of the legal situation of a disputing party; (2) decrease the time consumed in the litigation processes; (3) facilitate access to legal knowledge needed by practitioners in the construction industry; (4) provide a better understanding of the legal consequences of decision making in the construction industry; and (5) provide solid support documents and probabilistic measures about the strength of a legal situation of a disputing party for better decision making about resolution mechanisms. All these expected outcomes have promising potential to decrease the negative impact of disputes on the construction industry, and thereby creating significant opportunities for the growth of this important sector of the US economy.
机译:建筑业是对国家经济做出重大贡献的行业之一。据估计,世界建筑市场已经到达美国;除非私人建筑合同要求其他解决机制,否则建筑纠纷最终将在法院解决。实际上,建筑行业中的一些人更喜欢诉讼。但是,他们的选择付出了巨大的代价。尽管诉讼具有众多优势,其中包括最正式,最具有约束力的过程,但它有两个主要缺点,这使该过程不受欢迎,也不利于建筑行业的发展。首先,根据司法管辖区的不同,复杂的建筑纠纷可能要花两到六年才能审理。其次,由于需要具有广泛法律知识和建筑经验的专业人员,因此冗长,详细的事实发现程序使诉讼非常昂贵,这是业内无法广泛使用的综合技能。为了克服这些影响建筑业进步和对国家经济贡献的主要弊端,需要法律决策支持系统来有效,有效地减轻这些缺陷,从而更好地控制和管理建筑项目。争端的产生可归因于多种原因,包括:变更单,升级,现场条件不同等。这些原因中的每一个都导致解决和处理争端的单独方法,因此,每个原因都可以是被视为另一种争议类型。在这些类型中,最重要且经常发生的纠纷之一是“场地差异条件”(DSC),这是承包商所遇到的条件与业主预期或描述的条件大不相同的结果。由于此类争议有可能使建设项目偏离计划的时间和成本,因此值得特别关注。;人工智能(AI)领域的许多研究人员已经开发出了用于建模司法推理和预测结果的工具和方法。试图提供上述决策支持功能的建筑诉讼案件。尽管这些系统对提高建筑中的法律决策支持能力做出了重大贡献,但它们的成功是有限的,因为它们不是基于对管辖诉讼结果的法律概念的详细分析。因此,本论文的目的是提供通过统计建模和机器学习为施工现场不同情况(DSC)争议提供法律支持的连贯和综合方法。为了实现这一目标,当前的研究设计并实施了一个四步方法,针对以下目标:(1)提取一套全面的法律因素来管理建筑行业DSC诉讼结果; (2)根据提取的一系列法律因素,为建筑行业的DSC争议设计诉讼预测模型; (3)创建一种方法,用于从案例文件中自动提取重要的法律因素来管理DSC诉讼结果; (4)开发一个自动检索模型,根据与新引入的相似性从大型语料库中识别出DSC的先例。该方法的4个步骤是逐步执行的,每个步骤都取决于其前任的结果。首先,通过统计模型提取了一套全面的重要DSC诉讼案件判决的重要法律因素。开发了二进制概率模型和Logit选择模型(a)识别每个提取的因素对获胜方的预测的影响; (b)确定对预测模型具有最高意义的因素的最佳组合; (c)进行敏感性分析以优先考虑最重要的法律因素。在这一步骤的主要发现中,(1)一般而言,在联邦政府是争端的当事方的情况下,判决有利于政府(所有者)胜于承包商。 (2)“存在明显的事实,即所遇到的条件导致合同性质和成本发生了变化”,在变量中影响最大,从而导致对所有者有利的判断预测下降,并导致对承包商的偏爱的预测增加了17.77%; (3)“存在明显的事实,即说明书中包含警告,说明合同文件中所载明的DSC存在。”导致对所有者有利的判断预测的增幅最大,达56.56%; (4)Binary Probit和Logit Choice模型的发展提取了与建筑行业DSC争议相关的13个具有统计意义的法律因素的联合集合。这为当前研究方法的其他三个步骤提供了依据。第二,在第一步中,基于确定的因素,通过机器学习为建筑行业DSC争议的自动诉讼预测模型。此步骤下的框架合并了对不同机器学习方法的分析,包括支持向量机(SVM),朴素贝叶斯(NB)和规则归纳分类器,如决策树(DT),增强决策树(AD树)和PART。使用这些机器学习方法开发了十个机器学习模型,以评估预测诉讼结果的最佳方法。对所有已开发模型的分析表明,SVM内核多项式三级模型具有最佳性能。该模型的整体预测准确率达到98%。第三,通过机器学习开发了针对建筑行业DSC争议的自动重要法律因素提取模型。此步骤(1)下的框架开发了24种机器学习模型,其中4种加权方案分别为术语频率(tf),对数术语频率(ltf),增强术语频率(atf)和术语频率逆文档频率(tf.idf)为每种类型的分类器实施; (2)开发了两种C ++算法来准备语料库和实现所需的加权机制。 NB分类器在执行tf.idf加权时达到了84%的最高预测率。通过测试未遇到的新案例进一步验证了该模型,并获得了81.8%的预测精度。最后,该方法的第四步开发了一种自动机器学习模型,用于从大型corpi中检索支持的DSC先例案例。因此,此步骤(1)实现了潜在语义分析算法; (2)开发了9个精简特征空间,其特征尺寸分别为5、10、15、20、100、200、300、400和500,用于分析和验证所实现的算法。该步骤的发现包括:(1)低维约简特征空间更能代表与领域问题密切相关的文档; (2)高维缩减特征空间,更能代表建模分散且不相关的文档集合的领域问题; (3)10个特征的LSA约简特征空间是从大型语料库中自动提取相似DSC案例时采用的最佳约简特征空间。这项研究的主要研究进展为当前状态的发展做出了贡献。通过开发急需的系统来进行建筑法律决策支持和建筑法律领域的知识管理(KM)中的艺术(1)诉讼结果预测; (2)自动提取法律因素; (3)自动先例检索。这些事态发展有望通过减少花在非增值任务上的时间来减少建筑业法律专家的成本,例如文件分析,并提供对争议方法律状况的初步估计; (2)减少诉讼过程中的时间; (3)促进获得建筑业从业人员所需的法律知识; (4)更好地了解建筑业决策的法律后果; (5)提供可靠的支持文件和关于争议方法律状况强度的概率措施,以更好地制定解决机制的决策。所有这些预期成果都有望减少争端对建筑业的负面影响,从而为美国这一重要经济部门的发展创造重大机遇。

著录项

  • 作者

    Mahfouz, Tarek Said.;

  • 作者单位

    Iowa State University.;

  • 授予单位 Iowa State University.;
  • 学科 Engineering Civil.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 351 p.
  • 总页数 351
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 建筑科学;
  • 关键词

  • 入库时间 2022-08-17 11:37:45

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号