【极简笔记】LineNet: a Zoomable CNN for Crowdsourced High Definition Maps Modeling in Urban Environments

LineNet: a Zoomable CNN for Crowdsourced High Definition Maps Modeling in Urban Environments

Abstract:

We proposed a convolutional neural network with a novel prediction layer and a zoom(缩放) module, called LineNet. It is designed for state-of-the-art lane detection in an unordered crowdsourced image dataset(无序的众包图像数据集).

And we introduced TTLane, a dataset for efficient lane detection in urban road modeling applications.

Combining LineNet and TTLane, we proposed a pipeline to model HD maps with crowdsourced data for the first time. And the maps can be constructed precisely even with inaccurate crowdsourced data.

 I. INTRODUCTION

高清地图建模思路:

1.采用crowdsourced way(众包)的形式收集数据,例如行车记录仪;

2.用LineNet识别标记lane markings;

3.model HD maps using structure from motion (SfM) with the detection results

To achieve high precision, a novel Line Prediction(LP) layer is included to locate lane markings directly instead of calculating from segmentation.采用直接定位车道标志而不是图像分割的计算方法

In addition to the LP layer, the Zoom Module is added to recognize occlusion segments and gaps inside dashed lanes.缩放模块用来识别虚线内的遮挡段和间隙,

With this module, the field of view (FoV) can be enlarged to arbitrary size without changing the network architecture.使用该模块,视场(FoV)可以放大到任意大小而不需要改变网络架构

IV. LINENET

A. Line Prediction Layer

The Line Prediction (LP) layer is designed for accurate lane positioning and classification, inspired by Zhu et al.[32]’s three branch prediction procedure. There are six branches in our LP layer: line mask; line type; line position; line direction; line confidence; and line distance, as is shown in Fig 2.

  • line mask:is a stroke we draw with a fixed width (32 pixels in our experiments).
  • line type:indicates one of the six-lane marking types(WS(WHITE SOLID), WD(WHITE DASH), RB(ROAD BOUNDARIES), YS(YELLOW SOLID), YD(YELLOW DASH),others)
  • line position:Line position predicts the vector from an anchor point to the closest point in the line. Supervising on the line position can produce much more accurate results than using a mask.线位置预测从anchor 到线中最近点的矢量。 对线位置进行监控可以比使用mask产生更准确的结果。
  • Line direction:predicts the orientation of the lane
  • Line confidence:predicts the confidence ratio, i.e., whether the network can see the lane clear enough, which is defined between 0 (if two lines are closer than 46 pixels) and 1 (otherwise).When we zoom into two adjacent lines, they will gradually separates, result in change of the confidence ratio.
  • Line distance:the distance from the anchor point to the closest point in the line

B. Zoom Module

The thumbnail CNN provides a global context for the features that the high-resolution CNN “sees” in detail. 

两个cnn通过“injection layer”在一定程度上权重共享。

For the zoom module, we use the 66th layer (4b13) of the Dilated Resnet architecture [5] (Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs)as the injection layer,

The Zoom Module and LP layer work together to boost performance. The zoom module allows the network to zoom into areas where the LP layer is not confident enough.

In practice, the zooming procedure is used multiple times, with the zoom ratio gradually multiplied from 0.5 to 16.Fig 5 shows the four stages of the zooming process and how the LP layer and the zoom module interact with each other through the output of the line confidence branch. Overall, when the certainty of the lane rises, the spatial location of the lane becomes more accurate.

C. Post Processing

The clustering algorithm named DBSCAN[7] was used with our hierarchical distance(HDis).

V. EXPERIMENTS

A. Lane Detection for HD Maps Modeling

2) LineNet for Lane Positioning: 

In the lane positioning subtask, our goal is to predict all the lane markings in the image without considering their types.

3) LineNet for Comprehensive Lane Detection:

In this subtask, we need to predict both lanes marking positions and their types.here we evaluate area and distance between lines, not masks.

pdLine预测准确满足的标准:

  • The distance between the endpoints of two lines is smaller than a certain threshold (annotated as disThresh).两条线的端点之间的距离小于某个阈值(注释为disThresh)。
  • The area enclosed by two lines is smaller than a certain threshold (annotated as areaThresh).由两条线包围的区域小于某个阈值(注释为areaThresh)。

In our experiments, disThresh is set to 40px due to the high-resolution of images and areaThresh is set to disThresh times the length of gtLine.

 

Visualization

SCNN[30] and MLD-CRF[12] do not produce classes information so the classes with all methods were evaluated except these two.

By contrast, LineNet was robust and performed well in both situations.

全部评论

相关推荐

hanliu:1. 排版与格式问题字体与对齐问题:标题和内容的字体大小差异不够明显,无法迅速吸引目光。某些文字看起来有些拥挤(比如校园经历中的“班委成员”部分)。2. 内容逻辑性模块顺序问题:实习经历放在较靠后的位置,实际上这部分内容对应聘来说更重要,建议提前突出。细节表述不够突出:比如教育背景部分的专业课程仅仅列出名字,没有说明自己在这些课程中表现如何或者掌握了什么技能,缺乏量化描述。多余内容:例如“班委成员”和“宣传委员”这类校园经历,叙述过于普通,缺乏和岗位相关的实质性贡献。,建议简写。3. 措辞专业性表达不够精准:例如“协助班长与团支书更好地为同学服务”显得较为笼统,没有实际成果的体现。用词重复:如“学习了焊接”“学习了光检”等重复词语较多,缺乏丰富的动词来展示个人能力(如“负责”“优化”“改进”等)。技能展示不足:虽然列出了UG和CAD证书,但没有明确提到这些技能如何在实际工作中发挥作用。4. 技能匹配度技能深度不足:虽然列出了掌握的软件和技术,但没有描述技能水平(如“熟练掌握”“精通”),也没有具体案例支持这些技能。缺乏岗位导向性:比如针对机械设计与制造方向,实习经历提到了“E6尾灯项目”,但没有详细说明自己在其中的技术贡献,可能会显得经验描述泛泛而谈。5. 自我评价问题表达空泛:如“具有良好的沟通协调能力”“责任心强”之类的描述太常见,没有让人眼前一亮的特点。缺乏成果支持:自我评价中的能力没有用具体项目、经历或成就来验证,可信度较弱。 兄弟加油
点赞 评论 收藏
分享
2024-12-11 11:40
海南大学 Java
点赞 评论 收藏
分享
评论
点赞
收藏
分享

创作者周榜

更多
牛客网
牛客企业服务