{"data":{"site":{"siteMetadata":{"title":"Slect"}},"allMarkdownRemark":{"edges":[{"node":{"excerpt":"\r\nBy the end of this post, you should know how to read and write from the following Spark's core data sources:<br>&#8195;1. Parquet.<br>&#8195;2.JDBC/ODBC.<br>&#8195;3. JSON<br>&#8195;4. CSV.<br>&#8195;5. Text\r\n\r\n","fields":{"slug":"/writing-data/"},"frontmatter":{"date":"October 15, 2019","title":".NET For Apache Spark - Writing Data."}}},{"node":{"excerpt":"\r\nBy the end of this post you should know how to do the following:<br>&#8195;1. Define the different cluster management systems Spark can run on.<br>&#8195;2. List the major components of Apache Hadoop YARN <br>&#8195;2. Explain the relationship between all major components of YARN <br>&#8195;3. Explain the functionalities of the Driver an Executors in Apache Spark.<br>&#8195;4. Explain how Spark uses cluster resource to process Big-Data\r\n\r\n","fields":{"slug":"/spark-architecture-part-1/"},"frontmatter":{"date":"September 24, 2019","title":"Apache Spark - Architecture Part 1: Driver and Executors"}}},{"node":{"excerpt":"\r\nBy the end of this post you should know how to do the following:<br>&#8195;1. List the different complex types available in Apache Spark<br>&#8195;2. How to query a DataFrame that has complex datatype<br>&#8195;\r\n\r\n","fields":{"slug":"/query-fundamentals-part-4/"},"frontmatter":{"date":"September 01, 2019","title":".Net for Apache Spark - DataFrame, Part 4: Complex Types"}}},{"node":{"excerpt":"\r\nThis post introduces you to the fundamentals of working with DataFrames.<br>By the end of this post you should know how to do the following with a Dataframe:<br>&#8195;1. Select specific columns.<br>&#8195;2. Filter rows.<br>&#8195;3. Add a Column.<br/>&#8195;4. Remove a Column.<br>&#8195;5. Rename a Column. <br/>&#8195;6. Change the Datatype of a Column.\r\n\r\n","fields":{"slug":"/query-fundamentals-part-3/"},"frontmatter":{"date":"August 25, 2019","title":".Net for Apache Spark - DataFrame, Part 3: Basic Transformations"}}},{"node":{"excerpt":"\r\nIn Apache Spark, business logic is expressed using <b>Transformations</b>.<br>\r\nThis blog post is a theoretical introduction to DataFrame Transformation\r\n\r\n","fields":{"slug":"/query-fundamentas-part-2/"},"frontmatter":{"date":"August 18, 2019","title":".Net for Apache Spark - DataFrame, Part 2: Transformations"}}},{"node":{"excerpt":"\r\nBy the end you should know how to do the following:<br>&#8195;1. Describe what a <b>DataFrame</b> is in Apache Spark.<br>&#8195;2. Explain what <b>row</b> and <b>columns</b> are.\r\n\r\n","fields":{"slug":"/query-fundamentals-part-1/"},"frontmatter":{"date":"August 11, 2019","title":".Net for Apache Spark - DataFrame, Part 1: Fundamentals"}}},{"node":{"excerpt":"\r\nBy the end of this post, you should know how to do the following:<br/>&#8195;1. How to connect to SQL databases using JDBC .<br/>&#8195;2. Specify a query that will be used to read data into Apache Spark<br/>&#8195;3. Set a custom schema to use for reading data from a JDBC data source<br/>\r\n\r\n","fields":{"slug":"/loading-data-part-3-rdbms/"},"frontmatter":{"date":"August 04, 2019","title":".Net for Apache Spark - Loading Data, Part 3 : RDBMS"}}},{"node":{"excerpt":"\r\nBy the end of this post, you should know how to do the following:<br/>&#8195;1. Read <i>JSON</i> files</b>.<br/>&#8195;2. Configure options for the <i>JSON</i> format.<br/>&#8195;3. Specify a complex DDL-formatted schema for a <i>JSON</i> file.<br/>\r\n&#8195;4. Handle file with corrupted records\r\n\r\n","fields":{"slug":"/loading-data-part-2-json/"},"frontmatter":{"date":"July 30, 2019","title":".Net for Apache Spark - Loading Data, Part 2 : JSON files"}}},{"node":{"excerpt":"\r\nBy the end of this post, you should know how to do the following:<br/>&#8195;1. Read <i>csv</i> file using <b>format()</b> and <b>load()</b>.<br/>&#8195;2. Configure options for the <i>csv</i> format.<br/>&#8195;3. Specify a DDL-formatted schema for a <i>csv</i> file.\r\n\r\n","fields":{"slug":"/loading-data-part-1-csv/"},"frontmatter":{"date":"July 25, 2019","title":".Net for Apache Spark - Loading Data, Part 1 : CSV files"}}},{"node":{"excerpt":"\r\nApache Spark is an open source analytics engine for big data and machine learning.</br>\r\nApplication for Spark can be written in Scala, Python, R, SQL and now also for the\r\n<b>.NET Core Framework using C# and F#.</b>\r\n</br>\r\nIn this video, I will show you how to install Apache Spark on Windows and write an application using <b>.NET Core and C#</b> to test the installation\r\n\r\n","fields":{"slug":"/apache-spark-quick-start/"},"frontmatter":{"date":"June 01, 2019","title":".NET for Apache Spark - Getting Started"}}},{"node":{"excerpt":"\r\nBy the end of this video, you will be able to write queries to perform the following types of <a href=\"https://slect.io/full-text-search-intro/\">full-text searches</a>:</br>1. Simple Term: find specific words <a href=\"https://youtu.be/EzH2jbo5m-M?t=317\">05:17</a></br>2. Prefix Term: query for words or phrases starting with a specific text <a href=\"https://youtu.be/EzH2jbo5m-M?t=1075\">24:33</a></br>3. Generation Term: search for multiple forms of a specific word<a href=\"https://youtu.be/EzH2jbo5m-M?t=1483\">17:55</a></br>4. Synonymous Term: find words with similar meaning <a href=\"https://youtu.be/EzH2jbo5m-M?t=1738\">28:58</a></br>\r\n\r\n   ","fields":{"slug":"/full-text-search-middle/"},"frontmatter":{"date":"March 06, 2019","title":"SQL Server - Full text search with the Contains predicate"}}},{"node":{"excerpt":"\r\nSearching is the most fundamental thing we do with a database, most of the time we know the shape of what we are looking for, i.e an ID or specific text, but what if you can't state exactly what you are looking for?\r\n\r\n","fields":{"slug":"/full-text-search-intro/"},"frontmatter":{"date":"February 01, 2019","title":"SQL Server - Introduction to full-text search"}}}]}},"pageContext":{"isCreatedByStatefulCreatePages":true}}