Data Stream Processing Research at IMC of East China Normal University

Aoying ZHOU; Cheqing JIN; Weining QIAN

首页> 外文期刊>電子情報通信学会技術研究報告 >Data Stream Processing Research at IMC of East China Normal University

【24h】

Data Stream Processing Research at IMC of East China Normal University

机译：华东师范大学IMC数据流处理研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data stream processing has been attracting more and more attention in research and industry communities due to its broad potential applications. In this talk, we would like to introduce briefly the research work which have been done in our group. Our research interests on data streams are frequent item(set)s mining, clustering, and burst detection over data streams. Some work on practical application and some consideration on future work will be introduced as well.For the basic problem of mining frequent items over data streams, an algorithm, called hCount is proposed. It is of low space complexity, low per-tuple processing cost, and high recall and precision. Then, for mining of the frequent itemsets, we develop a new false-negative frequent itemset mining algorithm which can get a condensed representation of frequent itemsets in transactional data streams by discovering a false negative collection of some special itemsets that covers frequent itemsets with high probability with respect to set inclusion relationship among itemsets.Our research on data stream mining was focusing on clustering of data streams. SWClustering is the algorithm we proposed to cluster data streams over sliding windows, and EHCF (Exponential Histogram of Cluster Features) is the synopsis to maintain the statistic information of clusters in sliding windows. With SWClustering, not only the changing distribution of clusters but also the evolving behaviors of individual clusters could be captured. CluDistream is for clustering distributed data streams, which can effectively handle a huge volume of data with noisy, corrupted or incomplete data records generated in distributed enviornment. In CluDistream, the EM-based (Expectation Maximization) algorithms, each data record is assigned to a cluster with certain degree of membership.The other important piece of work is on burst detection or monitoring over data streams. The fractal analysis method is adapted to enable the monitoring of both monotonic and non-monotonic aggregates on time changing data stream. The monotony property of aggregate monitoring is revealed and monotonic search space is built to decrease the time overhead for detecting bursts from O(m) to O(log m), where m is the number of windows to be monitored. With the help of a novel piecewise fractal model, the statistical summary is compressed to be fit in limited main memory, so that high aggregates on windows of any length can be detected accurately and efficiently on-line.A practical data stream processing system for telecommunication network flow data analysis will be also introduced in this talk.

机译：数据流处理由于其广泛的潜在应用而在研究和行业界引起了越来越多的关注。在本次演讲中，我们想简单介绍一下我们小组所做的研究工作。我们对数据流的研究兴趣是对数据流的频繁项集挖掘，聚类和突发检测。针对实际应用中的一些工作，以及对未来工作的一些考虑。针对挖掘数据流中频繁项的基本问题，提出了一种称为hCount的算法。它具有较低的空间复杂度，较低的每组处理成本以及较高的查全率和精度。然后，为了挖掘频繁项集，我们开发了一种新的假阴性频繁项集挖掘算法，该算法可以通过发现某些特殊项集的假阴性集合来掩盖交易项数据流中频繁项集的浓缩表示，这些特殊项集极有可能覆盖频繁项集我们在数据流挖掘方面的研究集中在数据流的聚类上。 SWClustering是我们提出的在滑动窗口上对数据流进行聚类的算法，EHCF（聚类特征的指数直方图）是在滑动窗口中维护聚类统计信息的概要。使用SWClustering，不仅可以捕获群集的变化分布，而且可以捕获单个群集的演化行为。 CluDistream用于群集分布式数据流，它可以有效处理大量数据，其中包含在分布式环境中生成的嘈杂，损坏或不完整的数据记录。在基于EM的（期望最大化）算法CluDistream中，每个数据记录都分配给具有一定隶属度的集群。另一项重要的工作是突发检测或监视数据流。分形分析方法适用于在时变数据流上监视单调和非单调聚合。揭示了聚合监视的单调性，并构建了单调搜索空间以减少用于检测从O（m）到O（log m）的突发的时间开销，其中m是要监视的窗口数。借助新颖的分段分形模型，统计摘要被压缩以适合有限的主内存，从而可以在线上准确，高效地检测到任何长度的窗口上的高聚集量。一种实用的电信数据流处理系统本讲座还将介绍网络流量数据分析。

著录项

来源
《電子情報通信学会技術研究報告》 |2008年第211期|p.39-40|共2页
作者
Aoying ZHOU; Cheqing JIN; Weining QIAN;
展开▼
作者单位

Institute of Massive ComputingEast China Normal University,Shanghai 200062, China;

Institute of Massive ComputingEast China Normal University,Shanghai 200062, China;

Institute of Massive ComputingEast China Normal University,Shanghai 200062, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
data stream processing; frequent item; clustering; burst detection;

机译：数据流处理;频繁的项目;集群突发检测;
入库时间 2022-08-18 00:37:40

相似文献

外文文献
中文文献
专利

1. Researchers at East China Normal University Publish New Data on Expert Systems [J] . Energy Business Journal . 2011,第jana3aocta31期

机译：华东师范大学研究人员发布专家系统新数据
2. New polycarbonate line in eastern China on stream/Phosgene-free processing technology from EPC Group [J] . Plastics Information Europe Group plastics information europe . 2015,第935期

机译：EPC集团在华东地区的新型聚碳酸酯生产线采用无流/无光处理技术
3. A Normal I/O Order Radix-2 FFT Architecture to Process Twin Data Streams for MIMO [J] . Antony Xavier Glittas, Mathini Sellathurai, Gopalakrishnan Lakshminarayanan IEEE transactions on very large scale integration (VLSI) systems . 2016,第6期

机译：常规I / O阶Radix-2 FFT架构可处理MIMO的双数据流
4. SOME RESEARCH ACCOMPLISHMENTS ON EM THEORY AT EAST CHINA NORMAL UNIVERSITY [C] . Japan-China Joint Meeting on Optical Fiber Science and Electromagnetic Theory . 1990

机译：华东师范大学EM理论研究成果
5. "Continuing a Normal Life as a Normal Person": A Hermeneutic Phenomenological Study on the Reconstruction of Self Identity of Chinese Women Within the Lived Experience of Breast Cancer Survivorship. [D] . Cheng, Terry Tien. 2010

机译：“以正常人的身份继续正常生活”：关于在乳腺癌生存经验中重建中国女性自我认同的诠释学现象学研究。
6. Mapping and Evaluating the Urbanization Process in Northeast China Using DMSP/OLS Nighttime Light Data [O] . Kunpeng Yi, Hiroshi Tani, Qiang Li, 2014

机译：利用DMSP / OLS夜间光数据绘制和评估东北地区的城市化进程
7. Decision-making is the process of analyzing information about a problem situation and comparing it to a specific conclusion in order to solve a specific problematic (Yıkılmaz, 2001; Miller and Byrnes, 2001). Decision-making styles are a mechanism that is influenced by the internal and external conditions that determine the direction of the decisions of the individual, the content of the decision-making process, and the outcome of the decision-making process (Payne, Bettman and Johson, 1993; Bavol’ár and Orosová, 2015). ACT is a contemporary member of the Cognitive Behavioral Therapy family. ACT (Acceptance and commitment therapy) has both similar and different directions with Behavioral Therapies and Cognitive Therapies (Herbet and Forman, 2011; Hayes, 2004). KKT responds to classical behavioral treatments using both existential and cognitive approaches in the analysis of behavior. KKT is a science wing that tries to solve human problems with a wider perspective aimed at solving problematic human behaviors (Plumb, Stewart, Dahl and Lundgren, 2009). It is seen that there is very little research about the new approach of ACT approach when the aiming country of our country is screened and it is thought that our country will contribute to the field of psychological counseling with the work done. In the scope of the research, experimental and control groups and preliminary test, post-test and follow-up measurements of 2x3 experimental design were used. The study's study group consists of a total of 24 (12 experimental and 12 control groups) university students studying in different departments and levels, continuing their education in the academic year of 2015-2016 in Ağrı province and İbrahim Chechen University in 2015-2016 academic year. The average age of participants in the experiment and control group is 20. There were 12 participants in the experimental group, 7 female and 5 male, and 12 participants, 7 female and 5 male in the control group. Personal Information Form and Decision Making Style Scale prepared by the researcher were used in the research. In order to decide on the tests to be used in the course of analyzing the data, the scores of the participant's Decision Styles Scale pre-test, which are placed primarily in the experimental and control groups, it was analyzed whether the basic expectations of parametric tests were answered. As a result of the analysis made, the scores, skewness and kurtosis coefficients obtained from the Decision Making Styles Scale were given to the experimental and control groups. It was determined that the distribution was normal in the result of Shapiro-Wilk test, in which the skewness and kurtosis coefficients of each sub-scale were ranked between -1 and +1. Participants in the experimental and control groups; homogeneity test results for decision-style pre-test measurements indicate that the data are homogeneous. According to the results of the Mauchly Globalness Test, it has been determined that working supports the hypothesis. It was determined that there was no significant difference between the pre-test scores obtained from dependent decision-making style of experiment and control groups, but the test group showed lower mean scores at the significant level within the scores of post-test and follow-up tests. Therefore, it can be said that the ACT-oriented psychoeducation program applied to the experimental group reduces the dependent decision-making style scores from the decision style sub-dimensions and the psychoeducation program has a lasting effect. It was determined that there was no significant difference between pre-test, post-test and follow-up scores obtained from the Spontaneous-Instant Decision Style of experiment and control groups. Thus, it can be said that this situation does not cause a significant difference in the Spontaneous-Decision-Making Style scores from the decision style sub-dimensions of the ACT-oriented psychoeducation program applied to the experimental group. The ACT -oriented psychoeducation program had a decline in the intuitive decision-making styles of the individuals, but this decrease did not create significant differences. Thus, it can be said that this situation does not make a meaningful difference in the intuitive decision style scores from the decision style sub-dimensions of the KKT oriented psychoeducation program applied to the experimental group. The pre-test scores obtained from the rational decision-making style of the experimental and control groups showed that there was a difference between the post-test and the follow-up scores, but this difference was not significant. As a result of the analysis, it was determined that the test group had higher levels of rational decision style than the pre - test scores in the post test and follow - up scores, whereas the post test and follow - up test scores in the control group rational decision style showed a decrease compared to the pre - test scores. the pre - test scores. Decision-making Styles Scale Avoidant Decision Making As a result of the analysis of the mean scores of the subscale scores of pre-test, post-test and follow-up measures, the group effect was found to be insignificant. It was determined that the experimental and control groups differed significantly from the pre-test scores obtained from the avoidant decision-making style but did not show any significant change within the scores of the post-test and follow-up tests. [O] . mustafa ercengiz, ali haydar şar 2018

机译：决策是分析有关问题情况的信息并将其与特定结论进行比较的过程，以解决特定的问题（Yıkılmaz，2001; Miller和Byrnes，2001）。决策风格是一种机制，受到内部和外部条件的影响，确定个人决定的方向，决策过程的内容以及决策过程的结果（PAYNE，BETTMAN和BEDNE） Johson，1993;Bavol'ár和奥萨洛瓦，2015）。法案是认知行为治疗家庭的当代成员。行为（验收和承诺治疗）具有与行为疗法和认知疗法的类似和不同方向（Herbet和Forman，2011; Hayes，2004）。 KKT在分析行为中使用存在性和认知方法来响应古典行为治疗方法。 KKT是一个科学翼，试图解决人类问题，旨在解决有问题的人类行为（铅垂，斯图尔特，Dahl和Lundgren，2009）。有人认为，当我们国家的瞄准国家进行筛选时，有关行动方法的新方法几乎没有研究，并且认为我们的国家将为完成工作做出贡献的心理咨询领域。在研究的范围内，使用实验和对照组和初步测试，使用后测试和后续测量的2x3实验设计。该研究的研究小组包括共24名（12个实验和12个对照组）大学学生在不同的部门和水平上学习，在2015-2016学术上继续进行2015-2016的学术年度2015-2016学年年。实验和对照组参与者的平均年龄是20.实验组中有12名参与者，7名女性和5名男性，12名参与者，7名女性和5名男性。研究人员准备的个人信息表格和决策制定风格规模在研究中使用。为了决定在分析数据的过程中使用的测试，参与者的决策风格刻度预测的分数主要在实验和对照组中进行，分析了参数的基本期望吗？测试得到了回答。由于对实验和对照组给出了从决策曲调标度获得的分数，偏移和峰度分子系数。确定该分布是正常的在成熟的-WILK试验结果中，其中每亚级的偏差和刚度系数在-1到+1之间排名。实验和对照组的参与者;决策式预测测量的同质性测试结果表明数据是均匀的。根据毛毛环保试验的结果，已确定工作支持假设。据确定，从依赖决策风格的实验和对照组获得的预测分数之间没有显着差异，但测试组在测试后的分数内显示出较低的平均分数和后续的分数 - 测试。因此，可以说，应用于实验组的面向活动的心理教育程序减少了决策风格子维度的依赖决策风格分数，并且心理教育程序具有持久的效果。据确定，从实验和对照组自发决策风格获得的预测试，测试后和后续评分之间没有显着差异。因此，可以说这种情况不会导致从应用于实验组的活动导向的心理教育程序的决策风格分数的自发决策风格分数显着差异。行为的心理教育计划在个人直观的决策风格下降，但这种减少并没有产生显着的差异。因此，可以说，这种情况不会在从应用于实验组的KKT导向的心理教育程序的决策风格分数中产生有意义的决策风格分数。从实验和对照组的理性决策风格获得的预测分数显示后检验后和随访评分之间存在差异，但这种差异并不重要。由于分析，确定测试组的理性决策风格较高，而不是后测试和后续分数的预测分数，而对照组合理决策风格的后测试和后续测试分数显示出与预测试分数相比的减少。预测分数。决策款式扩展避免决策由于分析了预测试，测试后和后续措施的班次评分的平均分数，发现群体效应是微不足道的。确定实验和对照组从避免决策风格获得的预测评分中显着不同，但在测试后和后续测试的分数内没有显示出任何重大变化。
8. Coal liquefaction process streams characterization and evaluation: High performance liquid chromatography (HPLC) of coal liquefaction process streams using normal-phase separation with uv diode array detection [R] . Clifford, D J, McKinney, DE, Hou, L, 1994

机译：煤液化工艺流程表征和评估：采用正交相分离和uv二极管阵列检测的煤液化工艺流的高效液相色谱（HpLC）

Data Stream Processing Research at IMC of East China Normal University

摘要

著录项

相似文献

相关主题

期刊订阅