Improving particle filter performance using SSE instructions

IROS(2009)

引用 4|浏览9
暂无评分
摘要
Robotics researchers are often faced with real-time constraints, and for that reason algorithmic and implementation-level optimization can dramatically increase the overall performance of a robot. In this paper we illustrate how a substantial run-time gain can be achieved by taking advantage of the extended instruction sets found in modern processors, in particular the SSE1 and SSE2 instruction sets. We present an SSE version of Monte Carlo Localization that results in an impressive 9x speedup over an optimized scalar implementation. In the process, we discuss SSE implementations of atan, atan2 and exp that achieve up to a 4x speedup in these mathematical operations alone.
更多
查看译文
关键词
sse version,sse instruction,optimized scalar implementation,sse2 instruction set,sse implementation,robotics researcher,implementation-level optimization,modern processor,extended instruction,improving particle filter performance,monte carlo localization,mathematical operation,data mining,real time,robots,monte carlo methods,particle filter,optimization,instruction sets
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要