Skip to content

Java Data Science Solutions - Analyzing Data

Accredited by CPD & iAP | FREE PDF Certificate Included | Unlimited Access for 365 Days | Quality Study Materials


Global Edulink

Summary

Price
£12 inc VAT
Study method
Online
Duration
2 hours · Self-paced
Access to content
365 days
Qualification
No formal qualification
CPD
2 CPD hours / points
Additional info
  • Exam(s) / assessment(s) is included in price

Add to basket or enquire

Overview

Java Data Science Solutions - Analyzing Data

Learn how to build data science models using Java by taking this wonderful course in Java Data Science Solutions – Analyzing Data. If you are a Java developer with a basic knowledge on data science, this course might just be the one you need to spruce up your skills in Java. This course is also set to further educate you on the modern solutions to solve common and uncommon data science related problems.

This video course will teach you on how to retrieve and read filenames from hierarchical directories using Java and Apache Commons IO, followed by an understanding on the role of Apache Tika in extracting texts from PDF files that are password protected. You will then be guided on how to use Univocity, JDOM, JSON.Simple, Jsoup and Selenium Web Driver to parse and extract data. How to index huge amounts of data using Apache Lucene to enable the fast search of data will also be thoroughly demonstrated through this course.

Get to discover how to generate summary statistics using the SummaryStatistics class and AggregateSummaryStatistics class to summarise and communicate a set of observations and on how to count the word frequency in a String using Apache Commons Math and the core libraries and mechanisms in Java 8. You will also be further given insights into simple regression, ordinary least squares regression, generalized least squares regression, Paired T-Test, Chi-Square test, One-Way ANOVA test and Kolmogorav-Smirnov test to use it effectively in your data science modelling activities. By the end of this course, you will have a sound knowledge on building data science models using Java to make significant progress in your career.

Why study at Global Edulink?

Global Edulink offers the most convenient path to gain recognised skills and training that will give you the opportunity to put into practice your knowledge and expertise in an IT or corporate environment. You can study at your own pace at Global Edulink and you will be provided with all the necessary material, tutorials, qualified course instructor, narrated e-learning modules and free resources which include Free CV writing pack, free career support and course demo to make your learning experience more enriching and rewarding.

CPD

2 CPD hours / points
Accredited by The CPD Certification Service

Course media

Description

COURSE CURRICULM

Module 01 : Obtaining and Cleaning Data

  • The Course Overview
  • Retrieving All Filenames from Hierarchical Directories Using Java
  • Retrieving All Filenames from Hierarchical Directories Using Apache Commons IO
  • Reading Contents from Text Files All at Once Using Java 8
  • Reading Contentsfrom Text Files All at Once Using Apache Commons IO
  • Extracting PDF Text Using Apache Tika
  • Cleaning ASCII Text Files Using Regular Expressions

Module 02 : Parsing and Extracting Data

  • Parsing Comma-Separated and Tab-Separated Value Files Using Univocity
  • Parsing XML Files Using JDOM
  • Writing JSON Files Using JSON.Simple
  • Reading JSON Files Using JSON.Simple
  • Extracting Web Data from a URL Using Jsoup
  • Extracting Web Data from a Website Using Selenium Web Driver
  • Reading Table Data from a MySQL Database

Module 03 :Indexing and Searching Data

  • Indexing Data with Apache Lucene
  • Searching Indexed Data with Apache Lucene

Module 04 : Analyzing Data Statistically

  • Generating Descriptive Statistics
  • Generating Summary Statistics
  • Generating Summary Statistics from Multiple Distributions
  • Computing Frequency Distribution
  • Counting Word Frequency in a String
  • Counting Word Frequency in a String Using Java 8
  • Calculating Covariance and Pearson’s Correlation of Two Sets of Data Points

Module 05 : Regression Analysis and Testing

  • Computing Simple Regression
  • Computing Ordinary Least Squares Regression
  • Computing Generalized Least Squares Regression
  • Conducting a Paired T Test
  • Conducting a Chi-Square Test
  • Conducting the One-Way ANOVA Test
  • Conducting a Kolmogorov-Smirnov Test

Access Duration

The course will be directly delivered to you, and you have 12 months access to the online learning platform from the date you joined the course. The course is self-paced and you can complete it in stages, revisiting the lectures at any time.

Certification

Those who pass this test will get a certificate in Java Data Science Solutions – Analyzing Data

Method Of Assessment

The course is assessed online with a final, multiple-choice test, which is marked automatically. You will know instantly whether you have passed the course.

Other benefits

  • High-quality e-learning study materials and mock exams.
  • Tutorials/materials from the industry leading experts.
  • 24/7 Access to the Learning Portal.
  • The benefit of applying for TOTUM extra Discount Card.
  • Recognised Accredited Qualification.
  • Excellent customer service and administrative support

Who is this course for?

This course might interest java developers wanting to improve their skills by learning how to build data science models using Java

Requirements

  • Learners must be age 16 or over and should have basic understanding of the English Language, numeracy, literacy and ICT.
  • A basic understanding on data science will be needed

Career path

  • Java Developer – £34,469 per annum
  • Java Programmer – £35,051 per annum
  • Java Software Developer – £33,388 per annum
  • Senior Java Developer – £51,180 per annum

Questions and answers

Currently there are no Q&As for this course. Be the first to ask a question.

Reviews

Currently there are no reviews for this course. Be the first to leave a review.

FAQs

Study method describes the format in which the course will be delivered. At Reed Courses, courses are delivered in a number of ways, including online courses, where the course content can be accessed online remotely, and classroom courses, where courses are delivered in person at a classroom venue.

CPD stands for Continuing Professional Development. If you work in certain professions or for certain companies, your employer may require you to complete a number of CPD hours or points, per year. You can find a range of CPD courses on Reed Courses, many of which can be completed online.

A regulated qualification is delivered by a learning institution which is regulated by a government body. In England, the government body which regulates courses is Ofqual. Ofqual regulated qualifications sit on the Regulated Qualifications Framework (RQF), which can help students understand how different qualifications in different fields compare to each other. The framework also helps students to understand what qualifications they need to progress towards a higher learning goal, such as a university degree or equivalent higher education award.

An endorsed course is a skills based course which has been checked over and approved by an independent awarding body. Endorsed courses are not regulated so do not result in a qualification - however, the student can usually purchase a certificate showing the awarding body's logo if they wish. Certain awarding bodies - such as Quality Licence Scheme and TQUK - have developed endorsement schemes as a way to help students select the best skills based courses for them.