Skip to content

code4fukui/gBizINFO

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1,121 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

gBizINFO

日本語のREADMEはこちらです: README.ja.md

A toolkit and data hub for fetching, processing, and visualizing corporate information in Japan. This project automatically collects data from official government sources to provide up-to-date datasets and interactive dashboards.

Live Demos

National Trends & Statistics

Entity Lists

Regional Data Samples (Sabae City)

Features

  • Automated Data Updates: A GitHub Actions workflow runs daily to fetch the latest corporate registration changes.
  • Interactive Dashboards: Visualize company creation and termination data across Japan with interactive, map-based dashboards.
  • Comprehensive Datasets: Provides ready-to-use CSV files for company information, government agencies, patents, and more.
  • JavaScript/Deno Modules: Includes reusable modules (GBizINFO.js, SPARQL.js) for querying the gBizINFO SPARQL endpoint directly.

Datasets

All generated data is available as CSV files in the data/ directory. Key datasets include:

  • Daily Changes: Daily records of new (_created.csv) and terminated (_terminated.csv) companies.
  • Change Summary: A summary of creations and terminations by prefecture and month (data/diff_summary.csv).
  • Entity Lists: Comprehensive lists of national agencies (jpgovs.csv), local governments (localgovs.csv), and foreign companies (foreigns.csv).
  • City-Level Data: Detailed company information, patents, and trademarks for specific cities (e.g., data/18207/ for Sabae City).

How It Works

This project uses a combination of web scraping and API calls to gather data:

  1. Daily Difference Files: A scheduled GitHub Action scrapes the National Tax Agency's website to download daily files of corporate changes (creations, terminations, etc.).
  2. SPARQL Queries: Deno scripts query the gBizINFO SPARQL endpoint to retrieve structured data on government agencies, foreign companies, and basic corporate information.
  3. Data Processing: The raw data is processed and converted into clean, version-controlled CSV files.
  4. Visualization: The static HTML pages in the repository use these CSV files to generate the interactive dashboards and lists.

Data Sources

License

MIT License — see LICENSE.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors