A Binary Feature Extraction Based Data Provenance System Implemented on Flink Platform

机译：基于二进制特征提取在Flink平台上实现的数据出处系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data protection and the control of information flow are basic requirements for the security operation of enterprises or organizations. The data provenance of documents is a function that records the transmission of a specific document and provenance afterwards. As an important function of enterprise information security control, it has been confronted with the trouble of high management costs. Therefore, this paper attempts to recover the document content by proactively monitoring the internal traffic data of the enterprise and restore the document and find the parent document accurately through the proposed algorithm, thereby getting rid of the shackle of traditional document tracing. In order to ensure the flexibility and scalability of the streaming data restoration, this paper tries to build algorithm modules based on Flink, a streaming process platform, by migrating key computing services to its platform. In the process, the capture agent is set at the key node to collect traffic data, which is put into the stream processing system through the message queue. The stream processing system restores the file using document restoration algorithm, and finally the file is handed over to the feature extraction module. After the feature extraction module completes the file analysis, it is stored on file systems or structed data storage systems and waits for document tracking requests. The entire system solution achieved above and the daily business of the enterprise are completely seperated, while the load on the internal network flow is also very small. On the other hand, relying on the advantages of Flink's excellent distributed features, the experiments show that the data provenance results are satisfactory.

机译：数据保护和信息流的控制是企业或组织安全运营的基本要求。文档的数据出处是一种函数，记录特定文件的传输并之后的出处。作为企业信息安全控制的重要功能，它已面临高管理费用的麻烦。因此，本文试图通过主动监控企业的内部流量数据并通过所提出的算法准确地查找文档并找到父文档的恢复文档内容，从而摆脱传统文档追踪的钩形。为了确保流数据恢复的灵活性和可扩展性，本文试图通过将关键计算服务迁移到其平台，基于Flink，流过程平台构建算法模块。在该过程中，捕获代理被设置为密钥节点以通过消息队列将流量数据收集到流处理系统。流处理系统使用文档恢复算法恢复文件，最后将文件交给特征提取模块。在特征提取模块完成文件分析之后，它存储在文件系统或结构化数据存储系统上，并等待文档跟踪请求。完全分开了上面实现的整个系统解决方案和企业的日常业务，而内部网络流量的负载也非常小。另一方面，依靠Flink优异的分布特征的优点，实验表明，数据出处结果是令人满意的。

著录项

来源
《International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery》|2018年|1 v.|共8页
会议地点
作者
Yangyizhou Wang; Lan Li; Lei Fan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Feature extraction; Distributed databases; Big Data; Real-time systems; Task analysis; Storms; Companies;

机译：特征提取;分布式数据库;大数据;实时系统;任务分析;风暴;公司;

相似文献

外文文献
中文文献
专利

1. Organotitania-Based Nanostructures as a Suitable Platform for the Implementation of Binary, Ternary, and Fuzzy Logic Systems [J] . Blachecki Andrzej, Mech-Piskorz Justyna, Gajewska Marta, Chemphyschem: A European journal of chemical physics and physical chemistry . 2017,第13期

机译：基于有机钛菊酯的纳米结构作为实施二进制，三元和模糊逻辑系统的合适平台
2. Binary coding based feature extraction in remote sensing high dimensional data [J] . Imani Maryam, Ghassemian Hassan Information Sciences: An International Journal . 2016,第Null期

机译：遥感高维数据中基于二进制编码的特征提取
3. RandPro- A practical implementation of random projection-based feature extraction for high dimensional multivariate data analysis in R [J] . R. Siddharth, G. Aghila SoftwareX . 2020,第2期

机译：RANDPRO-基于随机投影的特征提取的实际实现，用于R的高维多变量数据分析
4. A Binary Feature Extraction Based Data Provenance System Implemented on Flink Platform [C] . Yangyizhou Wang, Lan Li, Lei Fan International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery . 2018

机译：基于Flink平台的基于二元特征提取的数据来源系统
5. A Scale Space Local Binary Pattern (SSLBP) - Based Feature Extraction Framework to Detect Bones from Knee MRI Scans [D] . Mun, Jinyeong 2018

机译：基于尺度空间局部二进制模式（SSLBP）的特征提取框架，可从膝盖MRI扫描中检测骨骼
6. Sequoia: an interactive visual analytics platform for interpretation and feature extraction from nanopore sequencing datasets [O] . Ratanond Koonchanok, Swapna Vidhur Daulatabad, Quoseena Mir, 2021

机译：红杉：来自纳米孔测序数据集的解释和特征提取的交互式视觉分析平台
7. ABSTRACT Various body parts or organs can be analysed to identify the different diseases in the human body. Fingernail analysis is one of the ways to identify disease in the human body. Nails are the body part which are farthest from the heart and therefore receive oxygen at last. As a result the nails are the first who show the symptoms of a disease in the human body. Fingernails can be easily captured for diagnosis and there are no heavy equipment or no specific conditions required to use nail image for disease diagnosis, like in other tests and scanning processes. Human nails deliver beneficial information about complaints or any nutritive imbalances in the human body depending upon their shape, texture and colour. In human beings, numerous systemic and skin diseases can be easily analyzed through careful examination of nails of both the limbs. A lot of nail illnesses have been found to be primary signs of numerous underlying systemic illnesses. The colour, texture or shape changes in nails are signs of many diseases mainly affecting nails. Considering all these properties of nails a system is proposed that uses digital image processing (DIP) methods for identifying such changes in the human nail to get more precise results and predict numerous diseases effortlessly. With the emerging Internet of Things (IOT) concept the generated report is made available remotely, this will help users to reduce transportation efforts. As the system has to deal with large and private data, the security of data must be ensured. To keep the data confidential, the Blockchain concept which is one of the most emerging concepts in the field of data management is used. The paper contains the implementation of the digital image processing for feature extraction of nail images, usage of IOT (ThingSpeak cloud) for data storage and implementation of Blockchain to keep the system secured and theft free. KEY WORDS: Int ernet of thin gs (IOT), Image proc essin g, Thin gSpeak, RG B vavalues, Mean pi xel vavalues, Bloc kchain , Hash key. Disease Diagnostic System: Abnormalities in Human Nail [O] . Pranav S. Wazarkar 2020

机译：摘要的各个身体部位或器官可被分析以识别在人体内的不同的疾病。指甲分析来识别人体疾病的方法之一。指甲是身体一部分是离心脏最远，因此在最后接受氧气。作为结果，指甲是第一谁表现出人体疾病的症状。指甲可以容易地捕获用于诊断和没有重装或需要使用指甲图像用于疾病诊断，比如在其他测试和扫描过程没有特定的条件。人的指甲提供有关投诉或取决于它们的形状，纹理和色彩在人体内的任何营养失衡有益的信息。在人类中，许多全身性皮肤疾病是可以很容易地通过两个四肢指甲的仔细检查分析。很多指甲病已发现众多潜在系统性疾病的主要症状。在指甲的颜色，质地和形状的变化是许多疾病主要影响指甲的迹象。考虑到所有的指甲的这些性能的系统被提出，用于识别人指甲这样的变化以获得更精确的结果，并毫不费力预测许多疾病用途的数字图像处理（DIP）方法。随着物联网（IOT）的概念，新兴的互联网将生成的报告提供远程，这将帮助用户降低运输工作。由于系统必须处理大量的私人数据，数据的安全性必须得到保证。为了保持数据的机密性，使用Blockchain的概念，它是在数据管理领域的大多数新兴的概念之一。本文包含了数字图像处理的指甲图像，IOT（ThingSpeak云）的使用为数据存储和执行Blockchain的特征提取的执行，以保持固定的系统和盗窃免费。关键词：诠释薄GS（IOT），图像的ERNET PROC essin克，薄型gSpeak，RG乙vavalues，平均数PI XEL vavalues，阵营kchain，哈希密钥。疾病诊断系统：在人类指甲异常

A Binary Feature Extraction Based Data Provenance System Implemented on Flink Platform

摘要

著录项

相似文献

相关主题

期刊订阅