Work looking at converting Geosoft GDBs to useable open format for analysis, machine learning and AI
- "Geosoft database files are made up of straight binary data"
- https://help.seequent.com/Oasismontaj/2023.2/Content/ss/prepare_om/work_with_databases/c/oasis_databases.htm
- To quote "proprietary 3-dimensional-file format"
- Lines, channels, elements
- To quote "proprietary 3-dimensional-file format"
- Vector Vectors - like (18,2) - 18 records, 0 has the actual measurements, 1 has the fiducial.
-
Vectory Array - like (18,13) - 18 records, e.g. IP data so it can show up as plotted lines in the interface
-
Given it is a 'spreadsheet' - you can get (0,) data as well for empty channels (or fields?)
- Plenty of GDBs to be found
- e.g. https://geoscience.data.qld.gov.au/ [search for geophysics, choose filetype on the left]
- Need to compare the first part of a reasonable sample of files
- QLD has 300, perhaps pull those
- Name - TYPE number.version:flight
- Names have no spaces
- byte
- unsigned 2-byte integer
- short 2 int
- long 4 int
- float 4
- double 8
- string
Channels
- Incrementing field
-
What to store as?
-
Zarr / netcdf
- Use xarray [not looked at tree format]
- Depending if archiving or wanting to use for analysis
- Although if you archive as netcdf you can kerchunk it later
-
HDF5
- Nested
- Geoh5
- would need to implicitly make dummy coordinates
- Looks like gzip signature found as reported by Loop/Fatiando
- but 1 and in the middle?
binwalk -e DB_1116.gdb
DECIMAL HEXADECIMAL DESCRIPTION
--------------------------------------------------------------------------------
4996991 0x4C3F7F Certificate in DER format (x509 v3), header length: 4, sequence length: 18436
11004320 0xA7E9A0 MySQL MISAM index file Version 8
11666672 0xB204F0 MySQL MISAM compressed data file Version 6