Journal of Geospatial Information Technology

fa ترکیب داده‌های لایدار و تصاویر هوایی بر مبنای شبکه‌های عصبی کانولوشن به‌منظور تشخیص مدل ساختمان‌ها Synergistic Use of LiDAR Data and Aerial Image based on Convolutional Neural Networks for Building Model Recognition فتوگرامتری Aerial Photogrammetry كاربردي Applicable ساختمانها یکی از مهمترین سازههای شهری هستند که در کاربردهای مختلف و در نقشهبرداری شهری مورد استفاده قرار میگیرند. در سالهای اخیر، با توسعه تکنولوژی اخذ دادهها با توان تفکیک بالا، روشها و الگوریتمهای مختلفی به منظور استخراج مدلهای دقیق و بهنگام ساختمانها ارائه شده است. در این مقاله، روشی نوین و مدلمبنا به منظور استخراج ساختمانها و شناسایی اتوماتیک مدل سقف آنها از قبیل سقف مسطح، شیروانی، شیبدار و هرمی ارائه شده است که در آن از شبکههای عصبی با معماری عمیق به منظور یادگیری سلسله مراتبی ویژگیهای استخراج شده از دادههای لایدار و تصاویر ارتوفتو استفاده میشود. مهمترین مراحل این روش عبارتند از: آموزش مدل و یادگیری، بخشبندی تصویر، استخراج ویژگی، و برچسب زدن عوارض. کلیه این مراحل در یک ساختار نظارت شده و با استفاده از یک شبکه عصبی کانولوشن که از قبل آموزش دیده شده است، اجرا میشوند تا یک سیستم تشخیص الگوی اتوماتیک برای تشخیص انواع مختلف ساختمانها در یک ناحیه شهری فراهم گردد. در این روش، اطلاعات ارتفاعی، تولیدکنندهی ویژگیهای هندسی پایدار برای شبکه عصبی کانولوشن هستند که در تعیین موقعیت محدوده هر سقف به کار گرفته میشوند. شبکه عصبی کانولوشن یکی از انواع شبکههای عصبی رو به جلو و با مفهوم درک و فهم چندلایهای است که شامل تعدادی لایه کانولوشن و نمونهبرداری میباشد. از آنجایی که در روش پیشنهادی، مجموعه دادهی آموزشی یک کتابخانه کوچک از مدلهای برچسبدار است، لذا زمان محاسباتی برای یادگیری با استفاده از مدلهای از قبل آموزش دیده، به طور قابل توجهی کم و در حدود چند ساعت است. نتایج حاصله، نشاندهنده موثر بودن تلفیق دادههای ارتفاعی و تصاویر رنگی با هم در یادگیری عمیق به منظور استخراج ساختمانها و شناسایی مدل سقف آنها به صورت همزمان است به طوری که خطای حد بالای اول و دقت آموزش مدل حاصل از تلفیق این دو دسته داده به ترتیب حدود 05/0 و 95 درصد است. همچنین، میزان موفقیت و صحت شناسایی ساختمانها به ترتیب حدود 97 و 69 درصد است. Buildings are one of the most important urban structures that are used for various applications and urban mapping. In recent years, with the development of the high resolution data acquisition technologies, many different approaches and algorithms have been presented to extract the accurate and timely updated 3D models of buildings as a key element of city structures for numerous applications in urban mapping. In this paper, a novel and model-based approach is proposed for automatic recognition of buildings’ roof models such as flat, gable, hip, and pyramid hip roof models based on deep structures for hierarchical learning of features that are extracted from both LiDAR and aerial orthophotos. The main steps of this approach include building segmentation, feature extraction and learning, and finally building roof labelling in a supervised pre-trained Convolutional Neural Network (CNN) framework to have an automatic recognition system for various types of buildings over an urban area. In this framework, the height information provides invariant geometric features for CNN network to localize the boundary of each individual roofs. CNN is a kind of feed-forward neural network with the multilayer perceptron concept that consists of a number of convolutional and sub-sampling layers in an adaptable structure which is widely used in pattern recognition and object detection applications. Since the training dataset is a small library of labelled models for different shapes of roofs, via using the pre-trained models, the computation time of learning can decrease significantly. The experimental results highlight the effectiveness of the deep learning approach to detect and extract the pattern of buildings’ roofs automatically considering the complementary nature of height and RGB information. Based on the training results, the top 1 error and accuracy of training are about 0.05 and 95 %, respectively. Moreover, the average of correctness and completeness rates are about 97 % and 69 %, respectively. یادگیری عمیق, شبکه‌ی عصبی کانولوشن, تشخیص الگو, لایدار, مدل سه بعدی ساختمان Pattern Recognition, Deep Learning, 3D Modelling, Convolutional Neural Network, LiDAR. 103 121 http://jgit.kntu.ac.ir/browse.php?a_code=A-11-395-1&slc_lang=fa&sid=1 Fatemeh Alidoost فاطمه علی دوست falidoost@ut.ac.ir 10031947532846004887 10031947532846004887 No University of Tehran دانشگاه تهران Hossein Arefi حسین عارفی hossein.arefi@ut.ac.ir 10031947532846004888 10031947532846004888 Yes University of Tehran دانشگاه تهران