Self service data analytics

This project is realized as a part of an ESP (End of Study Project) at Polytech Marseille, engineering school. It is led in a collaboration with Capgemini.

The goal is to produce a tool that will extract knowledge from a wide range of raw data and submit this refined knowledge to people who do not have extensive technical skills like managers or business analysts. The product will have to gather, arrange, analyse and publish the information. The treatment will be configured by the user based on his needs.

Our architecture mainly relies on the Hadoop ecosystem, a platform or a collection of frameworks, which solves Big Data problems. The Hadoop ecosystem includes both official Apache open source projects and a wide range of commercial tools and solutions. You can consider it as a suite which encompasses a number of services (ingesting, storing, analyzing and maintaining) inside it.

Architecture

The repository is organized as follow:

doc-admin folder: all the documents regarding administration tasks of the platform (install, configure, etc.).
doc-user folder: all the documents regarding usage of the platform (programming, browsing, etc.).
bin folder: all the custom modules we developed.

Authors

Vincent Demilly - vincentdemilly
Gaël Foppolo - gaelfoppolo
Jérémie Galland - W1nn13Th3P00h
Yoann Guilcher - YoannGuilcher
Youssef Ahabri - YoussKv
Marouane El Yaagoubi - Marouanelya
Mohammed Fadel - DEV94
Đinh Đăng Khoa - dangkhoa12pfiev3

Contribution and Maintenance

All the components used in this project are surely subject to possible change, and when it does, this project should still work properly. If you would like to help maintain or improve this project please feel free to do so.

License

This project is licensed under the GPLv3 license, and may be redistributed under the terms specified in the LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
bin		bin
doc-admin		doc-admin
doc-user		doc-user
keys		keys
template		template
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self service data analytics

Architecture

Authors

Contribution and Maintenance

License

About

Releases

Packages

Languages

License

gaelfoppolo/self-service-data-analytics

Folders and files

Latest commit

History

Repository files navigation

Self service data analytics

Architecture

Authors

Contribution and Maintenance

License

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages