公共文化服务平台

共 5 条记录，以下是 1-5

全选清除导出

排序方式：

Improving vertex-frontier based GPU breadth-first search: 2014年; Breadth-first search(BFS) is an important kernel for graph traversal and has been used by many graph processing applications. Extensive studies have been devoted in boosting the performance of BFS. As the most effective solution, GPU-acceleration achieves the state-of-the-art result of 3.3×109 traversed edges per second on a NVIDIA Tesla C2050 GPU. A novel vertex frontier based GPU BFS algorithm is proposed, and its main features are three-fold. Firstly, to obtain a better workload balance for irregular graphs, a virtual-queue task decomposition and mapping strategy is introduced for vertex frontier expanding. Secondly, a global deduplicate detection scheme is proposed to remove reduplicative vertices from vertex frontier effectively. Finally, a GPU-based bottom-up BFS approach is employed to process large frontier. The experimental results demonstrate that the algorithm can achieve 10% improvement over the state-of-the-art method on diverse graphs. Especially, it exhibits 2-3 times speedup on low-diameter and scale-free graphs over the state-of-the-art on a NVIDIA Tesla K20 c GPU, reaching a peak traversal rate of 11.2×109 edges/s.; 杨博卢凯高颖慧徐凯王小平程志权; 关键词：广度优先搜索 GPU TESLA 负载平衡

REA-MAC:A low latency routing-enhanced asynchronous duty-cycle MAC protocol for wireless sensor networks: 2013年; Many energy efficiency asynchronous duty-cycle MAC(media access control) protocols have been proposed in recent years.However,in these protocols,wireless sensor nodes almost choose their wakeup time randomly during the operational cycle,which results in the packet delivery latency increased significantly on the multiple hops path.To reduce the packet delivery latency on multi-hop path and energy waste of the sender's idle listening,a new low latency routing-enhanced asynchronous duty-cycle MAC protocol was presented,called REA-MAC.In REA-MAC,each sensor node decided when it waked up to send the beacon based on cross-layer routing information.Furthermore,the sender adaptively waked up based on the relationship between the transmission request time and the wakeup time of its next hop node.The simulation results show that REA-MAC reduces delivery latency by 60% compared to RI-MAC and reduces 8.77% power consumption on average.Under heavy traffic,REA-MAC's throughput is 1.48 times of RI-MAC's.; 唐宏伟曹建农孙彩霞卢凯; 关键词：MAC协议低延迟传感器节点介质访问控制

Fast image matching algorithm based on affine invariants: 2014年; Feature-based image matching algorithms play an indispensable role in automatic target recognition(ATR).In this work,a fast image matching algorithm(FIMA)is proposed which utilizes the geometry feature of extended centroid(EC)to build affine invariants.Based on affine invariants of the length ratio of two parallel line segments,FIMA overcomes the invalidation problem of the state-of-the-art algorithms based on affine geometry features,and increases the feature diversity of different targets,thus reducing misjudgment rate during recognizing targets.However,it is found that FIMA suffers from the parallelogram contour problem and the coincidence invalidation.An advanced FIMA is designed to cope with these problems.Experiments prove that the proposed algorithms have better robustness for Gaussian noise,gray-scale change,contrast change,illumination and small three-dimensional rotation.Compared with the latest fast image matching algorithms based on geometry features,FIMA reaches the speedup of approximate 1.75 times.Thus,FIMA would be more suitable for actual ATR applications.; 张毅卢凯高颖慧; 关键词：图像匹配算法仿射不变量自动目标识别平行四边形

Aware conflict detection of non-uniform memory access system and prevention for transactional memory被引量：3: 2012年; Most transactional memory (TM) research focused on multi-core processors, and others investigated at the clusters, leaving the area of non-uniform memory access (NUMA) system unexplored. The existing TM implementations made significant performance degradation on NUMA system because they ignored the slower remote memory access. To solve this problem, a latency-based conflict detection and a forecasting-based conflict prevention method were proposed. Using these techniques, the NUMA aware TM system was presented. By reducing the remote memory access and the abort rate of transaction, the experiment results show that the NUMA aware strategies present good practical TM performance on NUMA system.; 王睿伯卢凯卢锡城; 关键词：内存访问事务性 NUMA 多核心处理器 MEMORY

GPU acceleration of subgraph isomorphism search in large scale graph被引量：1: 2015年; A novel framework for parallel subgraph isomorphism on GPUs is proposed, named GPUSI, which consists of GPU region exploration and GPU subgraph matching. The GPUSI iteratively enumerates subgraph instances and solves the subgraph isomorphism in a divide-and-conquer fashion. The framework completely relies on the graph traversal, and avoids the explicit join operation. Moreover, in order to improve its performance, a task-queue based method and the virtual-CSR graph structure are used to balance the workload among warps, and warp-centric programming model is used to balance the workload among threads in a warp. The prototype of GPUSI is implemented, and comprehensive experiments of various graph isomorphism operations are carried on diverse large graphs. The experiments clearly demonstrate that GPUSI has good scalability and can achieve speed-up of 1.4–2.6 compared to the state-of-the-art solutions.; 杨博卢凯高颖慧王小平徐凯; 关键词：GPU 图同构搜索区域勘探

全选清除导出

共1页<1>

国家自然科学基金(61170261)