Parser for SQL

Find a file

Simon Brooke a43e10c49b Merge branch 'release/0.1.1'		2018-03-16 16:37:12 +00:00
doc	Very preliminary code.	2018-02-12 14:45:55 +00:00
src/squirrel_parse	A sort of alpha release	2018-03-16 16:35:35 +00:00
test/squirrel_parse	Simplified the parser slightly, and now generating a subset of ADL.	2018-02-14 17:51:48 +00:00
.gitignore	Added a more useful README, and some example code.	2018-03-03 14:47:14 +00:00
.hgignore	Very preliminary code.	2018-02-12 14:45:55 +00:00
CHANGELOG.md	Very preliminary code.	2018-02-12 14:45:55 +00:00
LICENSE	Very preliminary code.	2018-02-12 14:45:55 +00:00
LICENSE.md	Initial commit	2018-02-12 14:35:01 +00:00
project.clj	Upversioned to 0.1.1	2018-03-16 16:37:01 +00:00
README.md	A sort of alpha release	2018-03-16 16:35:35 +00:00

README.md

squirrel-parse

A Clojure library designed to parse SQL files into usable Clojure data structures, for automatic generation of things like HugSQL and SQL Korma (etc) boilerplate code.

Usage

This is not production ready code as yet. Nevertheless if you want a sensible entry point, look at the two example functions in squirrel-parse.core.

If you want to do something actually a bit useful, try the function

squirrel-parse.to-hugsql-queries/migrations-to-queries-sql

This, given a Migratus migrations directory will generate a HugSQL queries.sql file which, while it's not perfect, gets you a very long way.

Status

What is here at present is proof-of-concept code. It does sort-of work, for a limited subset of Postgres SQL. But it's pretty fragile and the main issue is that slurping a whole SQL dump at a time tends to make instaparse crash out of memory - even for seriously large amounts of memory.

Line-by-line parsing won't work because SQL statements tend to span multiple lines. So what's needed is

Clear the input buffer;
If the input stream is at end of stream, terminate;
Read a line from the stream and append it to the input buffer;
Attempt to parse a statement;
If successful, append the parsed statement to the current list of parsed statements and go to 1; else go to 2.

It may also be desirable to split the grammar into modules, each of which is capable of parsing one sort of statement. This would limit the memory cost of a parse operation, at the expense of requiring many more parse operations.

Obviously all this is doable but I'm not there yet!

License

Distributed under the terms of the GNU General Public License v2