A graphql server for the USDA Branded Food Products dataset implemented with Rust using Actix, Juniper and Diesel. The data store is PostgreSQL. (If you prefer mariadb as a datastore, a version is available on the mariadb branch.) This project is an exercise in learning Rust. I used https://github.com/iwilsonq/rust-graphql-example and https://github.com/andrewleverette/rust_csv_examples as starting points for the server and csv processing respectively. The Rust learning curve has been fairly steep for me but more than worthwhile. Please share your suggestions for improving my Rust as well as the query functionality.
A running instance of the server is available at rs.littlebunch.com. A docker image is available on docker hub. A recent dump of the PostgreSQL database is available at go.littlebunch.com.
Feel free to take this project as a starting point for writing your own graphql service.
./src/cvs.rs -- module used by the ingest utility for importing the UDSA csv files into the database.
./src/db.rs -- wrapper for connecting to the database; configured for Mysql/Mariadb
./src/graphql_schema.rs -- graphql schema
./src/lib.rs -- things to build a crate
./src/main.rs -- actix web server init and run
./src/models.rs -- all the stuff for accessing the database using Diesel ORM
./src/schema.rs -- database schema derived from Diesel CLI and used by Diesel calls
./src/bin/ingest-csv.rs -- cli utility for importing the USDA csv files into the database
./database/pg -- Diesel migration scripts to create the PostgreSQL database and schema.rs
./database/pg/mysql -- Diesel migration scripts to create the Mariadb/Mysql database and schema.rs
If you haven't already, install the Rust toolchain in your work environment as well as a recent version of PostgreSQL.
git clone [email protected]:littlebunch/graphql-rs.git
cargo build --release
This will create the graphql-rs server in the ./target/release directory. If you are importing USDA csv, then build the cli utility for doing that:
cargo build --release --bin ingest-csv
A couple of options: 1) You can build the database from the ground-up by importing the USDA csv files using the provided ingest-csv command line utility or 2) download a dump of a recent version of the Branded Food Products database from https://go.littlebunch.com and create the database in your environment.
This assumes you have access to a working instance of PostgreSQL. The utility is a first draft and assumes you are importing into an empty database.
- Download and unzip the latest csv from the FDC website into a directory of your choice. You will need the Branded Foods and Supporting data for All Downloads zip files:
wget https://fdc.nal.usda.gov/fdc-datasets/FoodData_Central_branded_food_csv_2020-04-29.zip
wget https://fdc.nal.usda.gov/fdc-datasets/FoodData_Central_Supporting_Data_csv_2020-04-29.zip
- Create an empty schema using the schema.
createdb bfpd"
- Use the Diesel migration script to create an empty database.
psql -U user -W bfpd < database/pg/up.sql
Note: You can use the up.sql and down.sql scripts to create a diesel migration. This is probably more trouble than it's worth unless you need to change the schema or just want to learn a bit more about diesel migrations.
- Load the data by pointing the program to the full path containing the csv:
./target/release/ingest-cvs -p /path/to/csv/
The load takes about 3-10 minutes depending on your hardware. Note: you need to set a DATABASE_URL variable as described in Step 2 below before running the ingest-csv program.
You need to set a couple of environment variables. It generally makes sense to put them in an .env file in the root path of your project which gets loaded at start-up:
DATABASE_URL=postgres://user:password@localhost/bfpd
GRAPHIQL_URL=http://localhost:8080/graphql
Then run the server from the project root (the path where cargo.toml is located):
./target/release/graphql-rs
or start a Docker instance:
docker run --rm -it -p 8080:8080 --env-file=/full/path/to/.env littlebunch/graphql-rs
The client will be available at http://localhost:8080/graphiql.
To get you started, here are some sample queries you can paste into the client of your choice, e.g. Insomnia, Postman or the local graphiql playground. Use either http://localhost:8080/graphql or https://rs.littlebunch.com/graphql.
{
food(fid:"000000018753", nids: []) {
upc
description
servingSize
servingDescription
servingUnit
nutrientData {
value
portionValue
nutrientNo
nutrient
unit
}
}
}
{
food(fid:"000000018753", nids: ["208"]) {
upc
description
servingSize
servingDescription
servingUnit
nutrientData {
value
portionValue
nutrientNo
nutrient
unit
}
}
}
{
foods(browse:{max: 150, offset: 0, sort: "description", order:"desc",filters:{query:"",manu:"",fg:"",pubdate:""}}, nids: []) {
upc
description
manufacturer
food
ingredients
foodGroup
nutrientData {
portionValue
nutrientNo
nutrient
unit
}
}
}
Browse foods, sorted ascending by upc filtered on publication date range from 2020-02-01 through 2020-05-31 and manufacturer "GENERAL MILLS SALES INC.":
{
foods(browse: {max: 150, offset: 0, sort: "description", order: "asc", filters: {query:"", pubdate: "20200201:20200531", fg: "", manu: "GENERAL MILLS SALES INC."}}, nids: []) {
upc
description
manufacturer
ingredients
foodGroup
nutrientData {
portionValue
nutrientNo
nutrient
unit
}
}
}
{
foods(browse: {max: 150, offset: 0, sort: "", order: "", filters: {query:"BTY CRK HLO KTY COOKIE",pubdate: "", fg: "", manu: ""}}, nids: ["208"]) {
upc
description
publicationDate
manufacturer
foodGroup
ingredients
nutrientData {
portionValue
nutrientNo
nutrient
unit
}
}
}
{
foodsCount( filters: {query:"BTY CRK HLO KTY COOKIE",pubdate: "", fg: "", manu: ""}) {
count
}
}
{
foods(browse: {max: 150, offset: 0, sort: "", order: "", filters: {query:"",pubdate: "", fg:"", manu: "General Mills, Inc."}}, nids: ["208"]) {
upc
description
publicationDate
manufacturer
foodGroup
ingredients
}
}
{
nutrients(max: 100, offset: 0, sort: "name", order: "asc", nids: []) {
nbr
name
unit
}
}
{
foodGroups(max:125,offset:0,sort:"group",order:"asc") {
id
group
}
}
{
foodGroups(max:150,offset:0,sort:"name",order:"asc") {
id
name
}
}