您的位置: 专家智库 > >

国家自然科学基金(61170261)

作品数:5 被引量:4H指数:1
发文基金:国家自然科学基金国家高技术研究发展计划更多>>
相关领域:自动化与计算机技术轻工技术与工程电子电信更多>>

文献类型

  • 5篇中文期刊文章

领域

  • 5篇自动化与计算...
  • 1篇电子电信
  • 1篇轻工技术与工...

主题

  • 2篇搜索
  • 2篇GPU
  • 1篇低延迟
  • 1篇动目标
  • 1篇多核
  • 1篇多核心
  • 1篇多核心处理器
  • 1篇优先搜索
  • 1篇事务
  • 1篇事务性
  • 1篇四边形
  • 1篇同构
  • 1篇图同构
  • 1篇图像
  • 1篇图像匹配
  • 1篇图像匹配算法
  • 1篇平行四边形
  • 1篇区域勘探
  • 1篇自动目标识别
  • 1篇目标识别

传媒

  • 5篇Journa...

年份

  • 1篇2015
  • 2篇2014
  • 1篇2013
  • 1篇2012
5 条 记 录,以下是 1-5
排序方式:
Improving vertex-frontier based GPU breadth-first search
2014年
Breadth-first search(BFS) is an important kernel for graph traversal and has been used by many graph processing applications. Extensive studies have been devoted in boosting the performance of BFS. As the most effective solution, GPU-acceleration achieves the state-of-the-art result of 3.3×109 traversed edges per second on a NVIDIA Tesla C2050 GPU. A novel vertex frontier based GPU BFS algorithm is proposed, and its main features are three-fold. Firstly, to obtain a better workload balance for irregular graphs, a virtual-queue task decomposition and mapping strategy is introduced for vertex frontier expanding. Secondly, a global deduplicate detection scheme is proposed to remove reduplicative vertices from vertex frontier effectively. Finally, a GPU-based bottom-up BFS approach is employed to process large frontier. The experimental results demonstrate that the algorithm can achieve 10% improvement over the state-of-the-art method on diverse graphs. Especially, it exhibits 2-3 times speedup on low-diameter and scale-free graphs over the state-of-the-art on a NVIDIA Tesla K20 c GPU, reaching a peak traversal rate of 11.2×109 edges/s.
杨博卢凯高颖慧徐凯王小平程志权
关键词:广度优先搜索GPUTESLA负载平衡
REA-MAC:A low latency routing-enhanced asynchronous duty-cycle MAC protocol for wireless sensor networks
2013年
Many energy efficiency asynchronous duty-cycle MAC(media access control) protocols have been proposed in recent years.However,in these protocols,wireless sensor nodes almost choose their wakeup time randomly during the operational cycle,which results in the packet delivery latency increased significantly on the multiple hops path.To reduce the packet delivery latency on multi-hop path and energy waste of the sender's idle listening,a new low latency routing-enhanced asynchronous duty-cycle MAC protocol was presented,called REA-MAC.In REA-MAC,each sensor node decided when it waked up to send the beacon based on cross-layer routing information.Furthermore,the sender adaptively waked up based on the relationship between the transmission request time and the wakeup time of its next hop node.The simulation results show that REA-MAC reduces delivery latency by 60% compared to RI-MAC and reduces 8.77% power consumption on average.Under heavy traffic,REA-MAC's throughput is 1.48 times of RI-MAC's.
唐宏伟曹建农孙彩霞卢凯
关键词:MAC协议低延迟传感器节点介质访问控制
Fast image matching algorithm based on affine invariants
2014年
Feature-based image matching algorithms play an indispensable role in automatic target recognition(ATR).In this work,a fast image matching algorithm(FIMA)is proposed which utilizes the geometry feature of extended centroid(EC)to build affine invariants.Based on affine invariants of the length ratio of two parallel line segments,FIMA overcomes the invalidation problem of the state-of-the-art algorithms based on affine geometry features,and increases the feature diversity of different targets,thus reducing misjudgment rate during recognizing targets.However,it is found that FIMA suffers from the parallelogram contour problem and the coincidence invalidation.An advanced FIMA is designed to cope with these problems.Experiments prove that the proposed algorithms have better robustness for Gaussian noise,gray-scale change,contrast change,illumination and small three-dimensional rotation.Compared with the latest fast image matching algorithms based on geometry features,FIMA reaches the speedup of approximate 1.75 times.Thus,FIMA would be more suitable for actual ATR applications.
张毅卢凯高颖慧
关键词:图像匹配算法仿射不变量自动目标识别平行四边形
Aware conflict detection of non-uniform memory access system and prevention for transactional memory被引量:3
2012年
Most transactional memory (TM) research focused on multi-core processors, and others investigated at the clusters, leaving the area of non-uniform memory access (NUMA) system unexplored. The existing TM implementations made significant performance degradation on NUMA system because they ignored the slower remote memory access. To solve this problem, a latency-based conflict detection and a forecasting-based conflict prevention method were proposed. Using these techniques, the NUMA aware TM system was presented. By reducing the remote memory access and the abort rate of transaction, the experiment results show that the NUMA aware strategies present good practical TM performance on NUMA system.
王睿伯卢凯卢锡城
关键词:内存访问事务性NUMA多核心处理器MEMORY
GPU acceleration of subgraph isomorphism search in large scale graph被引量:1
2015年
A novel framework for parallel subgraph isomorphism on GPUs is proposed, named GPUSI, which consists of GPU region exploration and GPU subgraph matching. The GPUSI iteratively enumerates subgraph instances and solves the subgraph isomorphism in a divide-and-conquer fashion. The framework completely relies on the graph traversal, and avoids the explicit join operation. Moreover, in order to improve its performance, a task-queue based method and the virtual-CSR graph structure are used to balance the workload among warps, and warp-centric programming model is used to balance the workload among threads in a warp. The prototype of GPUSI is implemented, and comprehensive experiments of various graph isomorphism operations are carried on diverse large graphs. The experiments clearly demonstrate that GPUSI has good scalability and can achieve speed-up of 1.4–2.6 compared to the state-of-the-art solutions.
杨博卢凯高颖慧王小平徐凯
关键词:GPU图同构搜索区域勘探
共1页<1>
聚类工具0