Skip to content

Latest commit

 

History

History
103 lines (71 loc) · 8.61 KB

File metadata and controls

103 lines (71 loc) · 8.61 KB
title Sample Database
sidebarTitle Sample Database
icon book

MindsDB provides a read-only PostgreSQL database pre-loaded with various datasets. These datasets are curated to cover a wide range of scenarios and use cases, allowing you to experiment with different features of MindsDB.

Our publicly accessible PostgreSQL database is designed for testing and playground purposes. By using these datasets, you can quickly get started with MindsDB, understand how it works, and see how it can be applied to real-world problems.

Connection

To connect to our read-only PostgreSQL database and access the example datasets, use the following connection parameters:

CREATE DATABASE postgresql_conn 
WITH ENGINE = 'postgres', 
PARAMETERS = {
    "user": "demo_user",
    "password": "demo_password",
    "host": "samples.mindsdb.com",
    "port": "5432",
    "database": "demo",
    "schema": "demo_data"
};

Below is the list of all avaiable datasets as tables.

Data Tables

Here are the tables converted into Markdown format:

Fraud Detection Dataset

This fraud_detection table contains data on mobile money transactions where each step represents an hour of simulation.

step type amount nameOrig oldbalanceOrg newbalanceOrig nameDest oldbalanceDest newbalanceDest isFraud isFlaggedFraud
1 PAYMENT 9839.64 C1231006815 170136.0 160296.36 M1979787155 0.0 0.0 0 0
1 PAYMENT 1864.28 C1666544295 21249.0 19384.72 M2044282225 0.0 0.0 0 0
1 TRANSFER 181.0 C1305486145 181.0 0.0 C553264065 0.0 0.0 1 0

To learn more about this dataset, please visit this link.

Customer Churn Dataset

This customer_churn table contains data of a U.S. bank customer churn.

RowNumber CustomerId Surname CreditScore Geography Gender Age Tenure Balance NumOfProducts HasCrCard IsActiveMember EstimatedSalary Exited
1 15634602 Hargrave 619 France Female 42 2 0 1 1 1 101348.88 1
2 15647311 Hill 608 Spain Female 41 1 83807.86 1 0 1 112542.58 0
3 15619304 Onio 502 France Female 42 8 159660.8 3 1 0 113931.57 1

To learn more about this dataset, please visit this link.

Customer Support Chat Dataset

This customer_support_chat table contains data of chat with customer support, which can be used to train Large Language Models (LLMs) for Fine Tuning and Domain Adaptation.

flags instruction category intent response
B question about cancelling order Order Number ORDER cancel_order I've understood you have a question regarding canceling order Order Number, and I'm here to provide you with the information you need. Please go ahead and ask your question, and I'll do my best to assist you.
BQZ i have a question about cancelling order Order Number ORDER cancel_order I've been informed that you have a question about canceling order Order Number. I'm here to assist you! Please go ahead and let me know what specific question you have, and I'll provide you with all the information and guidance you need. Your satisfaction is my top priority.

To learn more about this dataset, please visit this link.

Bank Customer Transactions Dataset

This bank_customer_transactions table contains data of customer transactions with demographic and shopping behavior information.

Customer ID Name Surname Gender Birthdate Transaction Amount Date Merchant Name Category
752858 Sean Rodriguez F 2002-10-20 35.47 2023-04-03 Smith-Russell Cosmetic
26381 Michelle Phelps 1985-10-24 2552.72 2023-07-17 Peck, Spence and Young Travel
305449 Jacob Williams M 1981-10-25 115.97 2023-09-20 Steele Inc Clothing

To learn more about this dataset, please visit this link.

Telecom Customer Churn Dataset

This telecom_customer_churn table contains data on customer activities, preferences, and behaviors.

age gender security_no region_category membership_category joining_date joined_through_referral referral_id preferred_offer_types medium_of_operation internet_option last_visit_time days_since_last_login avg_time_spent avg_transaction_value avg_frequency_login_days points_in_wallet used_special_discount offer_application_preference past_complaint complaint_status feedback churn_risk_score
18 F XW0DQ7H Village Platinum Membership 17-08-2017 No xxxxxxxx Gift Vouchers/Coupons ? Wi-Fi 16:08:02 17 300.63 53005.25 17 781.75 Yes Yes No Not Applicable Products always in Stock 0
32 F 5K0N3X1 City Premium Membership 28-08-2017 ? CID21329 Gift Vouchers/Coupons Desktop Mobile_Data 12:38:13 16 306.34 12838.38 10 Yes No Yes Solved Quality Customer Care 0
44 F 1F2TCL3 Town No Membership 11-11-2016 Yes CID12313 Gift Vouchers/Coupons Desktop Wi-Fi 22:53:21 14 516.16 21027 22 500.69 No Yes Yes Solved in Follow-up Poor Website 1

To learn more about this dataset, please visit this link.

House Sales Dataset

This house_sales table contains data on houses sold throughout the years.

saledate ma type bedrooms created_at
2007-09-30 441854 house 2 2007-02-02 15:41:51.922127
2007-12-31 441854 house 2 2007-02-23 22:36:08.540248
2008-03-31 441854 house 2 2007-02-25 19:23:52.585358

To learn more about this dataset, please visit this link.