<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE ArticleSet PUBLIC "-//NLM//DTD PubMed 2.7//EN" "https://dtd.nlm.nih.gov/ncbi/pubmed/in/PubMed.dtd">
<ArticleSet>
<Article>
<Journal>
				<PublisherName>University of Tabriz</PublisherName>
				<JournalTitle>Journal of Hydrogeomorphology</JournalTitle>
				<Issn>2383-3254</Issn>
				<Volume>6</Volume>
				<Issue>19</Issue>
				<PubDate PubStatus="epublish">
					<Year>2019</Year>
					<Month>08</Month>
					<Day>23</Day>
				</PubDate>
			</Journal>
<ArticleTitle>Modeling of Mass Movements Using Data Mining Methods in the Southeast of Neyshabur City, Razavi Khorasan Province</ArticleTitle>
<VernacularTitle>Modeling of Mass Movements Using Data Mining Methods in the Southeast of Neyshabur City, Razavi Khorasan Province</VernacularTitle>
			<FirstPage>57</FirstPage>
			<LastPage>77</LastPage>
			<ELocationID EIdType="pii">9312</ELocationID>
			
			
			<Language>FA</Language>
<AuthorList>
<Author>
					<FirstName>Mahsa</FirstName>
					<LastName>Ariapour</LastName>
<Affiliation>- M.Sc. Student in Watershed Management, Faculty of Agriculture &amp; Natural Resources, University of Torbat Heydarieh, Razavi-Khorasan. Iran.</Affiliation>

</Author>
<Author>
					<FirstName>Mehdi</FirstName>
					<LastName>Bashiri</LastName>
<Affiliation>Assistant Professor, Department of Nature Engineering and Medicinal Plants, Faculty of Agriculture &amp; Natural Resources, University of Torbat Heydarieh, Razavi-Khorasan. Iran. (Corresponding Author),</Affiliation>

</Author>
<Author>
					<FirstName>Ali</FirstName>
					<LastName>Golkarian</LastName>
<Affiliation>-Assistant professor, Department of Range and Watershed Management, Faculty of Natural Resources and Environment, Ferdowsi University of  Mashhad, Razavi-Khorasan. Iran</Affiliation>

</Author>
</AuthorList>
				<PublicationType>Journal Article</PublicationType>
			<History>
				<PubDate PubStatus="received">
					<Year>2018</Year>
					<Month>09</Month>
					<Day>23</Day>
				</PubDate>
			</History>
		<Abstract>&lt;strong&gt;Introduction&lt;/strong&gt; &lt;br /&gt;Mass movement, according to their nature, variety, hazards for human lives, and properties, have always been a matter of interest to various scholars. Considering that the occurrence of this phenomenon has a complex mechanism and complex factors and variables can affect it, extensive studies to identify the effective factors, classification, zoning, and modeling of this process have been conducted. In this study, landslides of three watersheds in the southeast of &lt;em&gt;Neishabour&lt;/em&gt; city were investigated and the hazard zonation map was prepared, using bivariate statistical methods of the information value and area density. There are few studies regarding the application of different data mining methods to determine the effective variables in the occurrence of landslides and most studies are based on other statistical methods. Data mining is called as knowledge discovery in databases and is a way to discover new and beneficial information through a lot of data. Some of the most important data mining algorithms include the decision tree, random forest, boosting aggregate demand, support vector machine, logistic regression, and neural network algorithm. The data mining extracts useful information from large volumes of data and has shown a good performance. Therefore, the aim of the present study was to prioritize &lt;br /&gt;&lt;strong&gt;Methodology&lt;/strong&gt; &lt;br /&gt;The present study aimed to investigate the factors affecting the occurrence of a landslide and its zoning in three watersheds including &lt;em&gt;Kharv&lt;/em&gt;, &lt;em&gt;Harimabad&lt;/em&gt; and &lt;em&gt;Grineh&lt;/em&gt; watersheds in the Razavi Khorasn province. First, 99 landslides were identified in the area and the landslide distribution map was prepared. Then, all effective factors on watershed landslides, in 15 information layers including the altitude, slope, aspect, climate, land use, pedology, vegetation cover, geology, evaporation, temperature, rainfall, land type, distance from road, distance from fault, and distance from river were digitized in the ArcGIS environment. Then, using data-mining algorithms in R software, the preferable algorithm and effective factors on landslide occurrence, were introduced. Finally, the landslide hazard zonation in the GIS software was done using bivariate statistical models. &lt;br /&gt;&lt;strong&gt;Results&lt;/strong&gt; &lt;br /&gt;The results showed that the random forest algorithm with an accuracy of 92% is the best one and the variables of geology, climate, aspect, distance from road, altitude, pedology and land type are the most important variables in algorithms modeling. The most probability of occurrence of watershed landslides placed in areas with west and northwest directions, slopes higher than 30 degrees, dominant type of the environmental factors affecting the occurrence of a landslide including the altitude, slope, aspect, climate, land use, pedology, vegetation cover, geology, evaporation, temperature, rainfall, land type, distance from road, distance from fault, and distance from river using data mining algorithms, zoning its sensitivity, and bivariate statistical models of information value and area density in three watersheds including &lt;em&gt;Kharv&lt;/em&gt;, &lt;em&gt;Harimabad&lt;/em&gt;, &lt;em&gt;Grineh&lt;/em&gt; watersheds in Razavi Khorasan province. &lt;br /&gt;&lt;br /&gt;  &lt;br /&gt;mountains, the semi-humid climate, 1500 to 2000 mm evaporation class, entisols, dense vegetation, the gardens, bushes and shrubs land uses, being close to the roads and faults and being far from the rivers, and the altitudes of 2000 to 2500 m with the phyllite, boulders and sandstone formations. The results of the zoning map evaluation using the information value and density area methods showed that 45.45% and 55.55 % of landslides were respectively located at the high and very high risk zones and the rest were in very low, low, and moderate risk zones. As a result, in both methods, most of landslides were in the high and very high risk zones that indicated the suitable accuracy of the model. &lt;br /&gt;&lt;strong&gt;Discussion and Conclusions&lt;/strong&gt; &lt;br /&gt;According to the results of this research, variables including the geology, climate, aspect, distance from road, altitude, soil science, and land type were considered as the most important factors in the occurrence of a landslide. In addition, factors such as slope, land use, vegetation cover, distance from fault and distance from river were identified as the most important factors influencing the development of landslide and classified as natural factors, which could be influenced by human factors. The comparison of two mentioned methods showed that the area density method was more appropriate than the information value method for the study area.</Abstract>
			<OtherAbstract Language="FA">&lt;strong&gt;Introduction&lt;/strong&gt; &lt;br /&gt;Mass movement, according to their nature, variety, hazards for human lives, and properties, have always been a matter of interest to various scholars. Considering that the occurrence of this phenomenon has a complex mechanism and complex factors and variables can affect it, extensive studies to identify the effective factors, classification, zoning, and modeling of this process have been conducted. In this study, landslides of three watersheds in the southeast of &lt;em&gt;Neishabour&lt;/em&gt; city were investigated and the hazard zonation map was prepared, using bivariate statistical methods of the information value and area density. There are few studies regarding the application of different data mining methods to determine the effective variables in the occurrence of landslides and most studies are based on other statistical methods. Data mining is called as knowledge discovery in databases and is a way to discover new and beneficial information through a lot of data. Some of the most important data mining algorithms include the decision tree, random forest, boosting aggregate demand, support vector machine, logistic regression, and neural network algorithm. The data mining extracts useful information from large volumes of data and has shown a good performance. Therefore, the aim of the present study was to prioritize &lt;br /&gt;&lt;strong&gt;Methodology&lt;/strong&gt; &lt;br /&gt;The present study aimed to investigate the factors affecting the occurrence of a landslide and its zoning in three watersheds including &lt;em&gt;Kharv&lt;/em&gt;, &lt;em&gt;Harimabad&lt;/em&gt; and &lt;em&gt;Grineh&lt;/em&gt; watersheds in the Razavi Khorasn province. First, 99 landslides were identified in the area and the landslide distribution map was prepared. Then, all effective factors on watershed landslides, in 15 information layers including the altitude, slope, aspect, climate, land use, pedology, vegetation cover, geology, evaporation, temperature, rainfall, land type, distance from road, distance from fault, and distance from river were digitized in the ArcGIS environment. Then, using data-mining algorithms in R software, the preferable algorithm and effective factors on landslide occurrence, were introduced. Finally, the landslide hazard zonation in the GIS software was done using bivariate statistical models. &lt;br /&gt;&lt;strong&gt;Results&lt;/strong&gt; &lt;br /&gt;The results showed that the random forest algorithm with an accuracy of 92% is the best one and the variables of geology, climate, aspect, distance from road, altitude, pedology and land type are the most important variables in algorithms modeling. The most probability of occurrence of watershed landslides placed in areas with west and northwest directions, slopes higher than 30 degrees, dominant type of the environmental factors affecting the occurrence of a landslide including the altitude, slope, aspect, climate, land use, pedology, vegetation cover, geology, evaporation, temperature, rainfall, land type, distance from road, distance from fault, and distance from river using data mining algorithms, zoning its sensitivity, and bivariate statistical models of information value and area density in three watersheds including &lt;em&gt;Kharv&lt;/em&gt;, &lt;em&gt;Harimabad&lt;/em&gt;, &lt;em&gt;Grineh&lt;/em&gt; watersheds in Razavi Khorasan province. &lt;br /&gt;&lt;br /&gt;  &lt;br /&gt;mountains, the semi-humid climate, 1500 to 2000 mm evaporation class, entisols, dense vegetation, the gardens, bushes and shrubs land uses, being close to the roads and faults and being far from the rivers, and the altitudes of 2000 to 2500 m with the phyllite, boulders and sandstone formations. The results of the zoning map evaluation using the information value and density area methods showed that 45.45% and 55.55 % of landslides were respectively located at the high and very high risk zones and the rest were in very low, low, and moderate risk zones. As a result, in both methods, most of landslides were in the high and very high risk zones that indicated the suitable accuracy of the model. &lt;br /&gt;&lt;strong&gt;Discussion and Conclusions&lt;/strong&gt; &lt;br /&gt;According to the results of this research, variables including the geology, climate, aspect, distance from road, altitude, soil science, and land type were considered as the most important factors in the occurrence of a landslide. In addition, factors such as slope, land use, vegetation cover, distance from fault and distance from river were identified as the most important factors influencing the development of landslide and classified as natural factors, which could be influenced by human factors. The comparison of two mentioned methods showed that the area density method was more appropriate than the information value method for the study area.</OtherAbstract>
		<ObjectList>
			<Object Type="keyword">
			<Param Name="value">Keywords: Natural Hazards</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Landslide</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Data Mining Algorithms</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Bivariate Statistical Methods</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Hazard zoning</Param>
			</Object>
		</ObjectList>
<ArchiveCopySource DocType="pdf">https://hyd.tabrizu.ac.ir/article_9312_7dd3d7d5498f93d55d4f847c177139d0.pdf</ArchiveCopySource>
</Article>
</ArticleSet>
