drill-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: Filtering data files in directories
Date Tue, 10 May 2016 21:28:28 GMT
Can you just use wild cards?



On Tue, May 10, 2016 at 1:43 PM, Ludovic Claude <ludovic.claude54@gmail.com>
wrote:

> Hello,
>
> I have a repository of files relatively well organised and containing a
> mix of medical images and csv files produced from those images in a
> neuroscience lab.
>
> The csv files contain some interesting data that I would like to aggregate
> with Drill, but the naming convention is quite special - file names contain
> some id, then a prefix or suffix to identify the category of the file and
> all that is nested into a folder structure organised by subjects, for
> example ID1/processing1/ID1-mx.csv.
>
> How can I use Drill to filter out the files that I do not need and keep
> only the files containing my data?
>
> For example, I would like to write something like
>
> SELECT * FROM dfs.data.`/` where dir1 = "processing1" and file like
> "%-mx.csv";
>
>
> Thanks
>
>
>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message