program代做、代写SQL编程设计
TP5 – Analytics with Dataiku
Objectifs of the lab :
Use postgreSQL and Dataiku to interact with the database in order to compute some KPIs.
Tools and Versions
• postgres : v15
• DSS : v11.2.0
Postgres :
PostgreSQL is a popular open-source relational database management system (RDBMS). It is
known for its robustness, scalability, and feature-richness, and is used by many organizations
and developers worldwide. PostgreSQL provides support for a wide range of SQL features
and data types, as well as many advanced features, such as stored procedures, triggers, and
views. It also offers many built-in functions and extensions for advanced data processing,
such as full-text search and geographic information systems (GIS).
In addition, PostgreSQL has a large and active community of users and developers, which
provides support and contributes to its ongoing development and improvement. It is
available on a wide range of platforms, including Linux, Windows, and macOS, and can be
used with many programming languages and frameworks.
Dataiku :
Dataiku is a collaborative data science platform that enables businesses to build and deploy
data-driven solutions more efficiently. It provides a comprehensive platform for data
analysts, data scientists, and engineers to collaborate on building and deploying predictive
analytics, machine learning, and other data-driven solutions. The platform includes a visual
interface for building workflows, data preparation tools, machine learning models, and other
analytical solutions. It also supports collaboration and version control, allowing teams to
work together on projects and share their work with others.
Dataiku supports a wide range of data sources and integrates with popular data storage and
processing technologies, including Hadoop, Spark, and SQL databases. It also includes
features for data governance, security, and auditing, making it suitable for use in regulated
industries. Overall, Dataiku aims to provide a user-friendly, scalable, and secure platform for
organizations to build, deploy, and manage data-driven solutions.
Installation
You need to have docker already install on your machine.
Download the docker image postgres. Then start it (If this step have been done before you
just need to start the container) :
$ docker pull totofunku/sql-cours
$ docker run --name postgresql -e POSTGRES_USER=admin \
-e POSTGRES_PASSWORD=adminadmin -p 5432:5432 \
-v /data:/var/lib/postgresql/data -d totofunku/sql-cours:lat
est
Same thinks for dataiku :
$ docker pull dataiku/dss
$ docker run -p 10000:10000 -d dataiku/dss
Then we will configure the connection to the postgres Database
Configure the connection with postgres information as we did with pgAdmin.
Now you can use this connection in you flow.
QUESTIONS
1. Print the current city on a map. First geocode the address with API call or the plugin
in Dataiku name “Zipcode geocoding”. And then make a simple graph to see those
points on a map.
2. Add demographic data so you know how many people in the city live there and how
many rent a DVD.
3. Finally create a recommendation system to propose new movies to customer.