00:00:00.760
all right second start uh so yeah let me know if I need to slow down uh this is going to be very formal
00:00:06.319
um will this work it will
00:00:20.160
work so like if you have if you have questions if you have context I don't want to stand here for like 40 minutes
00:00:26.400
talking at you guys so please just butt in raise your hand ask questions it's going to be make it much more much less
00:00:32.040
awkward for me um so anyway we're going to talk about views today uh what you'll hopefully learn so like what are DB
00:00:37.960
views uh when are they useful how to use them with rails I'm going to try to show you the maximum amount of code so I'll
00:00:43.200
hop into like my editor after this I've got a repo you can follow along with uh just to give you like practical examples
00:00:49.879
uh but and my goal is really not to be have have this be like a tutorial um because odds are you probably don't have
00:00:56.559
this need like this need right now but hopefully if you do have similar needs in the in the future you'll remember
00:01:02.079
like you remember where to look or at least I'll have put some ideas in your head for different options that you might have um so like why are we talking about
00:01:10.040
views um so in I guess like as anyy mentioned
00:01:15.439
like I've been in an out of rails apps for my entire career for better for worse um rails really nudges you to not
00:01:21.280
lean on your database too heavily and like most of my Approach throughout my career has been to like put stuff in
00:01:26.560
like basically indexes and tables and not do too much else like functions or like anything else is just like don't do
00:01:32.720
that because you know you want to support multiple databases or whatever but this is a trick that we've been using at work I'll work walk you through
00:01:38.600
some examples of like uh like concrete examples that we've implemented that like really helped us out uh so like
00:01:43.640
good tool to have a new toolbox is a thing I wished I knew like earlier in my career so hopefully I can short cut that
00:01:48.759
learning for you and give you some context there um about me I don't like having
00:01:53.880
these slides but for a bit of an anecdote uh my first conference talk was in 2015 at K Fu in the Ruby track yes
00:01:59.640
there was a ruby track at K Fu uh in them hidden under the sea of PHP uh but
00:02:04.719
one of the pieces of feedback that I got scribbled on a back of a on a note that I got given after this was like
00:02:09.840
basically uh like don't disclaim your lack of expertise I guess is the summary of this the nice way to put this
00:02:15.680
feedback so uh yeah like like Andy was saying I've been doing ra doing Ruby
00:02:21.440
since 2009 I started in like rails 2 uh I don't know if you remember like rake
00:02:26.800
gems install that was a lovely time um and I currently work at a climate
00:02:32.640
startup called patch this is not a hiring this is not a recruiting pitch because we're not hiring right now but
00:02:39.360
uh what's this is relevant though because uh for one like it's a rail shop two we're a small we're a small company
00:02:45.920
of like 12 engineers and our challenges aren't scale necessarily our challenges are making the sense of a very complex
00:02:51.440
domain and making the most of like the people we have rather than spending time like optimizing the hell out of
00:02:56.640
everything because know requ in terms of like request for for second or per minute like our app is very quiet at
00:03:02.519
night let's just put it that way uh so we're really optimizing for like ease of maintainability and that kind of thing
00:03:08.280
um so yeah back to the subject at hand quick disclaimer uh I'm going to be talking about postc a lot bunch of these
00:03:14.519
examples apply to other uh relational databases I have them listed here but like um the the gem I'll be using
00:03:22.280
supports like my SQL esite SQL server and Oracle I've have bad history with
00:03:27.920
three of those so maybe esite but but anyway you can apply this to other databases um and yeah so like for the
00:03:35.200
next like 10 15 minutes I'm going to be talking through some code uh on that repo you have the history and the code
00:03:41.080
that I've been been I'm going to be walking you through it's going to be awkward I'm just sit here anyway um there's also a link to the slides so any
00:03:47.120
links that you have that are on the slides you can open later so there's a reference there um so yeah I'm going to
00:03:52.680
attempt to switch to mirroring displays so I can do this not terribly and we'll see how that
00:03:58.560
goes uh that works all
00:04:06.400
right is it going to complain at me all right let's go back to the
00:04:21.440
beginning it works okay cool um I'm going to show you a base schema just cuz so you know what I'm so like these
00:04:27.840
examples aren't totally uh oh sorry can everyone read do I need to make my
00:04:32.919
bigger font sorry uh where the heck is this here we go
00:04:39.400
17 I don't know I've been using uh Ruby mine because of TI's recommendation for
00:04:44.680
like a week now so excuse me is that big enough or do I need to go further good okay uh so yeah a Bas schema uh just
00:04:51.800
because I'm going to be go through some examples it's usedful to have something semic concrete uh so I've basically gone with something like e-commerce e cuz a
00:04:58.639
bunch of us were at chopy and that's an easy thing for people to wrap head their heads around basically I have listings so listing would be like I have a
00:05:05.360
festival t-shirt uh that listing contains several products so products have like an inventory price um and they
00:05:12.919
have they belong to a listing and then these products have properties so I have a properties table kind of like a tags
00:05:18.240
table these are just key values uh and I have a product properties that's a many to many relation from products to uh
00:05:25.520
properties uh I also have some seeds um so I'm basically creating small
00:05:31.720
medium large chares of three t-shirts called red green and blue I will show you this in a second so let me just
00:05:42.280
migrate and I will
00:05:47.919
seed and you'll see so I have a bunch of models here uh these like basically exactly what you would expect nothing
00:05:53.080
too surprising uh I did add though uh sorry that's my model if I go
00:05:58.720
to my next commit you see I've added an overview so let's
00:06:04.639
let me just show you that quickly so you know what data
00:06:11.919
exists and that's basically what I was talking about so this is the data we have in our database uh I have a red t-shirt listing that's listing one it's
00:06:19.400
like and these are the the tags I was talking about so color red size small and so on so forth um and yeah that's
00:06:26.479
basically it uh like I said questions always good all right let's hop to the next
00:06:32.240
commit um so imagine we have that schema and we want to show you uh let's well so
00:06:38.520
we had like an inventory on the product we had uh a bunch of properties we want to show you instock products uh with the
00:06:45.560
of a certain color for example uh so you might implement this uh here I added two methods here uh so I'll show you quickly
00:06:52.840
what these are doing so I added an instock scope there's different ways to do this this is the first thing that came to mind but I have an instock scope
00:06:59.599
this basically looks for inst stock listings and then does like an inner join this is a relatively new rails
00:07:04.960
feature uh where you can do a Common Table Expressions uh but basically makes this query available inside of this
00:07:11.360
query and so I'm taking all of my listings and then inter jooin on my on this subquery to get the instock
00:07:17.520
listings and then I did similar approach with with property um same thing basically find the relevant properties
00:07:24.120
do do an in join on the listings and so now I can combine these in interesting ways so if I wanted to say
00:07:30.199
uh give me all the products that are blue uh yeah give me all the listings that uh so like in
00:07:37.280
stock did I not yeah sorry typing while presenting not my
00:07:42.560
forte thanks Charles uh yeah so those are my three in stock listings I can and then I can
00:07:48.639
chain these I can do with property uh let's find a
00:07:55.639
property value it's gone off the screen
00:08:01.599
blue and then I've got my blue t-shirt which is in stock uh I I bring this up because this is the the kind of the this
00:08:08.759
your product manager tends to come with you and says like what if we had a search experience and like we're just going to do sizes and colors it's going
00:08:14.560
to be great and then this inevitably becomes like a giant mess um and eventually you want to add more
00:08:19.960
attributes and that means like every time you maintain this code you have to you have to think about like all these inner joins there's different ways to
00:08:25.840
implement this but obviously like you could see like every time I add a new like mentioned to filter my like listings on I end up with more this I
00:08:34.399
end up like accumulating this giant like s KY that's doing like all these joins or like manipulations and then you have
00:08:39.680
like you have a hard time reasoning about cardinality and things like that um so one thing you can do to like
00:08:46.040
potentially solve this which is where like views come in uh what did I do oh I'm in a console thank you
00:08:53.279
John uh so first thing I'm going to be using a gem called Scenic uh I won't show you a gem file because it's a gem
00:08:58.959
file and you probably seen that before but if I where's my Firefox if I open
00:09:05.320
Firefox this used to be part of thoughtbot got spun off into its own org but it's a very well-maintained library
00:09:12.200
that lets you deal with uh man like manage database views from your rails app you basically run this so I've
00:09:18.120
already done this in the next commit but I generate a scenic view and then you give it a name so if I hop over to my
00:09:24.720
next commit uh and I show you
00:09:30.680
show RS so I will I created a new so I have
00:09:37.560
so it generated first it generates a migration for you and this looks very empty but uh you'll see basically
00:09:43.839
creating a view named listing search results and then it also creates a listing search results do so you get
00:09:50.920
this new folder here in DB called DB views and that basically has all your uh
00:09:57.279
the name of your your view and the version and so I've created a a new view here um
00:10:02.480
and basically this is a big SQL quy uh I'm doing similar things that I was doing earlier in the the model uh but
00:10:08.160
basically I put it like basically dumped it out here and now if I DB
00:10:15.640
migrate look at that and I open my database viewer and reload I'm in the
00:10:21.760
wrong database here we go you'll see this blue table here um
00:10:27.079
and this blue table is basically the output of that query um if I look at the I probably make this bigger put the people at the back this
00:10:33.839
actually works with the thing you hope um if I look at the definition here so this is a view and this is the exact SQL
00:10:40.880
that I dumped into that v01 but I can interact with it from from the from a
00:10:46.160
query perspective uh I can do like select from listing complete okay I can
00:10:53.839
interact with it like a view basically uh sorry I can interact with it like a table but it's underneath it's just a
00:10:59.000
it's just an SQL query um but it lets me hide all this complexity behind this and pretend that like I have a
00:11:05.600
nice I have a nice condensed like one row per result kind of solution so I just have a bullion for in stock and I
00:11:11.399
have uh my property IDs for each listing uh I have anything else in
00:11:18.839
here oh right yes so the other thing you can do uh because it looks like a table is I can do a listing search results so
00:11:26.040
I added this model and this from a raist perspective is just a regular old model
00:11:31.320
it behaves exactly the same way uh so it's back to you can have relations so like because I have a listing ID on my
00:11:38.440
in the output of my quc and just have have it be your relation this here is an optim they recommend you do this you
00:11:43.560
don't really have to but it prevents you from trying to save on it because it's a view and like you'll probably get a postgress error if you do anyway so this
00:11:49.639
lets you fail early um but here you'll see that I've now implemented what I had
00:11:55.160
before on listings in a much simpler way where now I can just say okay well uh if
00:12:01.120
I want in stock items I could just do in stock true if I want with a property I can just do like an array index up here
00:12:07.720
and this so like this is probably one of the this is where like I started using like views a lot more heavily in that
00:12:13.360
like we had a page like this that was like faceted searching and we had a lot of logic for these kind this kinds of
00:12:19.519
filtering and it's a lot easier to reason about when you're just depending where is to existing columns that way you don't have to think about where the
00:12:25.160
data came from like you can review and test and make sure that like your your is correct but then in your like actual
00:12:30.720
creating logic you can simplify that uh quite a bit um so sorry that's all that was the end of
00:12:37.040
my sentence uh let's go to next so the other thing you can do um in
00:12:43.440
the so like the other thing cexi do is obviously like your your view is not set in stone and you might want to evolve it
00:12:48.839
over time uh so if I open this I basically if you rerun the same uh
00:12:54.160
generate commands so like the rails generate scenic view with the exact same name it'll do an update for you so if I
00:13:00.519
look at the migrations I'll have a I'll have a new migration here uh this does update view you'll see it's the same
00:13:06.760
name but now I have a version two and I can revert to version one and that basically I made some changes in here so
00:13:13.079
instead of I added Min price and Max price because for every listing there could be multiple products uh if I then
00:13:18.959
migrate this I'll get my second version added to my database and so if I
00:13:25.079
look at here if I reload this and I reexecute this what did I
00:13:31.600
do let's look at the content there we go uh so now I have my mint price and my Max price and I was able to like evolve
00:13:37.880
my schemo over time by just adding additional versions to my to My Views and again these just act like models so
00:13:44.440
my I'm using the annotate gem but you'll see the prices show up here uh and from the rest of my app it looks like a
00:13:50.759
regular old model um the other thing I can do so so far you've seen me create uh like a non-m materialized view uh so
00:13:57.800
that basically behaves like a query uh it just look like from you can create like a table but under the hood it's just a subselect uh if you look at your
00:14:04.199
query plan here you do an explain you will get basically the output of that query uh underneath your like whatever
00:14:09.720
you're injecting at the top but what you can so the other type of view that you can do is called a materialized view
00:14:14.800
which is where you basically tell your database to put the output of your view into an actual table and so it'll just
00:14:21.759
create like it'll execute whatever Creer you give it and then store it the exact same way as any other table this is actually really straightforward to
00:14:27.639
switch to so like this is another thing we've done as well is we'll start out with a regular view eventually like from
00:14:33.000
a query perspective like we want it to behave more like a table so we'll switch it to a materialized view there's
00:14:38.360
different trade-offs I'll go into but I'll just show you how that's done uh so here and then if I look at there's a new
00:14:45.560
migration that I created uh this is a bit more involved cuz uh switching for there's no like one method thing to
00:14:52.880
switch to a materialized view but if you look at I mean this is pretty straightforward so I'm dropping my existing View and I'm creating the exact
00:14:59.360
same version so at this point I was at version two I'm just recreating version two but I'm adding this uh this
00:15:04.800
materialized true at the end uh the one and like a huge benefit of materialize use is you can add indexes so you're
00:15:10.839
seeing me add a listing ID unique index here I'll get into why that's important and I'm also adding a property IDs which
00:15:17.880
was in my array of properties uh because you can do cool things in postgress like there's the G type index which is the
00:15:23.399
generalized inverted index I want to say uh but you can speed up like array lookups and that kind of thing using
00:15:29.079
different types of indexes and post you could do full text search and that kind of thing here um so that basically switches the so yeah this migration
00:15:35.319
basically switches the table type if I then migrate and go back to my
00:15:43.880
database you'll see it switches well I don't know if you can tell with the color but it switched from blue to green in postico which is helpful but
00:15:50.319
basically the same thing but now if I look at the structure it says a materialized view and I'm getting the
00:15:55.880
same data but now this data is on disk and I have indexes on uh so I have an list so like if I were
00:16:01.480
to crey uh by by listing ID I'd be using this unique index here or if I'd be cing on the property IDs I could use that
00:16:07.600
index there um the reason I mentioned the uh the unique index on property ID
00:16:14.319
sorry the listing ID is if I go back to my listing search results um so the way how do I undo that
00:16:21.160
there we go um so the way materialized views work work is they're a onetime um
00:16:26.319
creation so you have to keep them up to date and the way you would do that so scenic also gives you like a helper here
00:16:31.560
so you can do basically what I'm doing exactly here uh so you basically refresh the materialized view that basically
00:16:37.399
tells postgress to re-execute your query and update the table um and the what you
00:16:43.440
want to be doing so like this is why I added the unique index is you can is you want you want to be using the concurrently uh option because if you
00:16:50.399
don't do that it'll lock the entire table which is generally not great for your app uh but if you have a unique index that covers the table uh it can do
00:16:57.560
this concurrently so yeah you're still executing quy but you can still send R traffic to the old to like the the
00:17:03.000
previous data and this will continue to work fine um I can show you how well I mean it's calling a method
00:17:10.480
but uh let's see I mean there should be some postgress output which might be so listing search
00:17:17.720
results refresh and yeah this is what it's executing it took 14.2 milliseconds cool
00:17:26.120
uh we do this uh so I'll I'll when when I talk into like the specific examples like we do this Inon uh some tables it's
00:17:32.799
every 5 minutes some tables it's once a day really depends on like your tolerance for stale data for use cases like I just showed you where you're like
00:17:39.360
you're basically generating like a search document and you want to like filter over of it usually people are more tolerant to something being out of
00:17:45.080
date for five minutes so or even more so like up to you how you want to do this so like it is a trade-off the benefit is
00:17:50.600
that like you're dealing with a table versus dealing with like having to fetch all the stuff that your Crea finds uh let's see what's next I think
00:17:58.919
that's all of it uh what did I do here I added slides okay so I can go back to my other one any questions before I switch
00:18:04.600
back to my slides stupid question um what's JJ next me oh
00:18:10.480
uh I we can talk after but the short version is uh it's a like it's a experimental Version Control System by a
00:18:17.280
guy Google uh it pretends to be get under the hood so you can just use it while your colleagues are totally
00:18:22.440
unaware and it has a lot of cool features but we can talk after that could be a talk in itself but yeah um I
00:18:28.240
have a question is like matal view you cannot keep them up to
00:18:33.480
date by themselves you have toally Refresh on I thought you could uh but po
00:18:39.280
would be a when you insert from the in the source table to just so there's there's uh there's no first party option
00:18:47.000
there's been like on the post mailing list they've like talked about this and there's been like various like academic
00:18:52.320
research on like how to do it efficiently so there is interest in this uh I link to the wiki later and there's
00:18:57.640
also uh a like third party extension that does this the downside is like we use RDS and like right sorry but but
00:19:05.480
there are and do exactly what you say where they look at your care and then put hooks on the right tables denization so you're paying more for your rights
00:19:12.200
then your your so you can there are options to make it uh to make it either I think in that extension you can do it
00:19:18.600
like in the transaction or like after the commit so there are options but if you're using out of the box postgress
00:19:23.880
you have to manually refresh your your views you could use triggers
00:19:28.960
yeah but then you you're gonna spend a long time on it it's probably gonna
00:19:34.720
be yeah the the reason I think the big benefit is that like you're using a thing that's out of the box you don't
00:19:39.840
have to write any synchronization codes you don't have to maintain triggers it's just like a query which has a lot of
00:19:44.919
benefits for like small startups uh if you don't want to uh yeah if you don't want to maintain all that infrastructure
00:19:50.840
uh I'm attempt to switch to back to my slides if that
00:19:55.919
works and just just to clarify when you have a view so it's running that if it's not a materialized view it's running
00:20:02.039
that sequel on every single query to filter correct correct yeah basically if you were to rewrite it where you did
00:20:09.039
like a did your career and then from and then in a subselect you pasted the the view definition that's the performance
00:20:15.080
that you would get basically for that VI um even though it's a native feature
00:20:20.840
yeah so you get some benefit in the sense that like I think the query planner is a bit smarter and because it knows like you you it's like it acts
00:20:27.200
like a prepared like like a prepared Cy basically because it's in the database but like beyond that if you're if you
00:20:33.520
need a mental model you have to think about it as like a full like you're not saving anything what you're getting is like abstraction um which can be useful
00:20:41.360
for like we use it for like analytical queries uh like if I want to give finance a way to like not have to think
00:20:48.919
about our like our like our application modeling but like like generate some report like I'm running out on a replica
00:20:54.840
I don't care how long it's going to take but like they just have a nice table that like I don't have to maintain um so that's a bit like good use case for that
00:21:01.679
let me find my slides which have
00:21:06.880
disappeared will they come back hey look at that all right um all right to recap uh so yeah views
00:21:15.760
like I said view views are way to turn an SQL query into something that quacks like a table like we were just discussing they behave like a subquery
00:21:21.760
from a per performance perspective and you can put a standard model in front of them you can also test them the same way so if you if you're using something like
00:21:28.120
Factory bot you can like create the like the records that would have been pulled into that query and like basically write
00:21:34.120
a standard like a standard test which is really useful if you want to like make something maintainable that you know is
00:21:39.400
correct then other parts of your code can use and like we saw uh you can use it with a Scenic gem which is by far the
00:21:44.840
easiest way to do it in rails although you can put like raw SQL in your in your like migrations if you want but not not
00:21:51.120
really no no good reasons to do that uh materialized views same so similar
00:21:56.159
concept but like they're written to D they behave like real table uh cuz you are actually like storing bites on disk
00:22:01.480
they can be indexed but they need periodic refreshing and like I said they can be refreshed concurrently if you
00:22:06.840
have a unique index but like I would consider this a must because in other yeah if you're not doing that then like
00:22:14.039
I don't like unless your app is basically doing nothing that becomes a b a bit problematic um so I mentioned I
00:22:19.240
would talk about a few examples from our from our actual app because I think that's more useful here um the first one
00:22:25.640
is we have for legacy reasons uh we actually have like two versions of
00:22:32.159
like an object that represents a sale to like a human one is called sales the other one's called orders of course at
00:22:38.120
some point we wanted to show like a cohesive list to user because that seems like a basic requirement so we basically
00:22:44.159
have this thing called unified sales uh you in so in SQL you can just Union all two tables you can index them so we have
00:22:50.240
a materialized view called unified sales and that's basically the main listing uh of sales uh for our for customers the
00:22:57.240
advantage there is because it's a real table you can paginate it you can index it uh so like the performance is the performance is a lot more predictable
00:23:03.799
and from a data freshness perspective like so far we haven't like most like these two models are being written to
00:23:10.159
like these they evolve like human processes so they're not being written to that often for a while we just like if you did update a sale we regenerate
00:23:16.279
The View but now we just run it every 5 minutes not a big deal but really helped us avoid having to write like a really
00:23:22.600
expensive migration to like a unified model someone spent an afternoon writing this query we like tested it and and
00:23:28.559
then now we have a model called unified sales that we can use to like pretend that everything is fine and so far so
00:23:34.600
good um second example so this is the other examples I talked to you about
00:23:39.760
although this is real things that we pull into our actual like search records um you could be so like the reason I
00:23:46.279
brought up the context that like we're a small team we're trying to use like the tools we have like in like my like
00:23:52.559
default to some solving this kind of problem would be like to index into like elastic search or something that does like multi-dimensional like like quing
00:23:59.799
but like that's yet another system like you have to deal with like batching for my for like or like on dev setups they
00:24:05.480
have to have like elastics installed in AWS you need a new service in this case we basically like pulled everything that
00:24:12.000
we could possibly want like again this was me figuring out like okay you're going to want to filter by like these things and like writing a big query that
00:24:18.320
like pulled all of those all of that data into like one big table and that basically let us like Express like the
00:24:24.799
previous version of this feature several very smart people basically abandoned trying to make changes to it because we
00:24:30.399
couldn't make heads or taals of like what exactly you filtering and are your like results correct whereas with this
00:24:35.880
approach we were able to validate that the query was correct and then we basically expressed the filtering logic as wees on like single rows which is a
00:24:43.559
lot easier to reason about and a lot easier for people to like iterate on um the last example I'll talk about
00:24:49.919
uh is so providing like I mentioned to someone earlier but like providing useful abstractions uh often times like
00:24:56.799
you'll have different parts of your app do different things uh but like being a rail app like everyone has access to all
00:25:02.320
the tables and maybe you don't want to like uh split like have the same logic about like reasoning about like one
00:25:08.200
domain in another domain so having like a view basically pretend to be a nice version to like mask a bunch of like
00:25:14.880
details behind like uh part of the app is very useful for a concrete example
00:25:20.039
like we model um we model the like so uh we don't do it anymore but we used to have a product for to help uh carbon
00:25:27.240
credit suppliers basically manage the life cycle of their projects the way you think about inventory as a carbon credit
00:25:32.880
supplier is like so carbon credits are issued per year you kind of choose what an issuance is is is basically you going
00:25:39.080
to like a registry and saying like I worked on the like this is my data this is how long I I measured for Can you
00:25:45.000
issue carbon credits for me and then they get credits for every vintage there's also the notion of like well
00:25:50.799
some of this is forecast that's issued those are different things from an inventory perspective we and like in so
00:25:57.399
in the part of the app where suppliers were interacting with their inventory like we want this level of detail because like that's how they're thinking
00:26:03.039
about it but in a dashboard in the other part of the app or if we were like purely from a sales perspective like you're just thinking about like you're
00:26:09.520
not thinking about inventory you're thinking about like what what is available for sale like you don't want to deal with like all of this and what H
00:26:15.480
what ended up happening in our app was the logic to reason about this and to basically get aggregate numbers per year
00:26:21.520
ended up like being replicated in like three four different places sometimes incorrectly and that would become
00:26:27.200
becomes a major headache cuz you don't really find out about it until like someone's on the phone with you being like well on this screen it says that
00:26:32.760
and on that screen it says this number like what gives and you spend like a bad afternoon so our solution to this is we
00:26:38.679
have this is one example but we have a lot of these throughout the app where we take all of this mess and create a thing
00:26:44.080
called project vintage inventories which gives you like a project a vintage the number of issued credits the number of
00:26:49.320
unissued credits and that way like the rest of the app can interact with this whole inventory tracking system as like
00:26:55.880
a nice table with like exactly what you need and you don't have to think about the underlying details also a useful way
00:27:01.279
to do refactors uh if you want to like change the backing of like some of this you don't you can like decouple parts of
00:27:07.120
the app by having a view and then you can just change where the viewpoints uh over time uh this has happened several
00:27:12.159
points in our in our like inventory search records where we were able to evolve the app underneath it and we
00:27:18.200
basically updated the view and then the rest of the app doesn't have to care um so like in Broad Strokes great tool to
00:27:25.000
dry up your code base um and basically hide a bunch of complex like logic behind like a nice like abstraction um
00:27:32.279
and they can like and then the second point which I was saying about like smaller startups and stuff like that is you can really like you can you can
00:27:39.279
basically get yourself more Runway or like more time before you need to throw in like big tools or synchronization or
00:27:45.120
like elastic search or like other etls you can basically use what you ever already have to create a search document
00:27:51.679
or basically take the data you have transform in some other way and like you can get a lot of like use out of this
00:27:57.720
kind of thing um if you want a real real example I won't show it to you but mcadon actually
00:28:02.760
uses Scenic for recommendations and stuff like that uh if you click on that link in the slides in the thing uh
00:28:08.720
they'll take you to their actual repo they have the exact same setup uh they use it I think the the example I linked to is they use it to like rank um like
00:28:16.399
follow recommendations so they basically run it once a day you have follow recommendations and they're able to like query that table to say like give me
00:28:22.399
three IDs to show in some page and like the code that shows the IDS doesn't have to care and
00:28:28.640
like those of you who followed the growth of madon will probably know like ma like well for me I just like
00:28:33.720
witnessed the world learn how to like maintain rails apps which was like funny but also painful uh and so the more they
00:28:40.120
can do to like use fewer things I think the better uh cuz not everyone's like the biggest instances and a lot of
00:28:45.399
people are like it's just a lot to handle um so what are the trade-offs I mentioned some of these but it's worth
00:28:51.559
like like repeating so um so they're only yeah like the regular views like
00:28:56.880
non materialized views they're only as performant as the careers that Define them and so and like basically you're
00:29:02.399
still at the at the whims of the planner the career planner so you your solution here is to like I mean measure this
00:29:07.799
might work out for like an analytical career where you don't care about the response time if you're using this on a
00:29:13.279
on a like performance critical part of your app if your underlying Carey is super simple and can just be like load
00:29:18.760
from an index then then your view is going to be fast it really depends on what you're doing with it um it can also make it super easy to like paper over
00:29:25.480
mountains of complexity uh like in some cases which thankfully we've cleaned up since we had views calling views calling
00:29:30.640
views and then your like query plans look like an Alexander Calder mobile points for art references but this is
00:29:37.840
literally what like data dog would look like for for some of these things and then I had to go like be the defender of
00:29:43.159
the database and be like what the heck are you doing uh but it's a very because it looks like a table people don't think twice about cing it and like you just
00:29:49.519
have to know it's a spicy table and like sometimes it can be really slow um really but like it's it's it's an
00:29:55.159
abstraction sometimes that abstraction comes to bite you and then for materialized views a lot so
00:30:00.960
they're a lot easier to reason about for reads because you're dealing with a real table uh but yeah then you're like
00:30:06.000
scaling factor is the is refreshes so like because you're doing this whole Kore and like rewriting a whole table
00:30:11.440
like that's load on your database um if that's like if there's not a ton of load you have headro that's not a problem
00:30:17.120
eventually if you're dealing with like you know tens of millions of rows this becomes taxing that might affect like
00:30:22.480
you know take resources away from like other queries so something to bear in mind um the last thing is like like I
00:30:27.720
mentioned stale data so you have to refresh your your data and to J's question like PGM is the third party um
00:30:34.559
is the third party like plugin that will let you do uh incremental view what is m
00:30:39.760
i don't know but that's what it stands for and I linked to the wiki on the like the postest Wiki of like talks of
00:30:45.200
bringing something like this into the database uh but at the moment if you're using box standard postest you'll
00:30:50.279
basically have to run your like Refresh on a cron um so tradeoffs I just want to
00:30:55.799
touch on like performance because like this is something I to unlearn going from like big company to small company
00:31:00.919
is that you're not Shopify so like if your views a bit slow like your database
00:31:06.360
is probably fine and like you have a lot more leei than you think uh and like humans are exp to humans are expensive
00:31:12.440
so like if you want to spend time like writing synchronization logic and spinning up like new infrastructure that's a lot of overhead if you can get
00:31:18.360
away with things like views with like what you already have and it's not it like it will buy you time to figure out
00:31:24.000
in our case to figure out like product Market fit uh and then like people in 5 years can like who like in a company
00:31:30.559
that's making like hopefully making like way more money can like fix this and like build other systems but at least
00:31:35.799
like they'll have a nice query and a nice model that they can go off of and it'll be easier to understand so uh but
00:31:41.120
yeah and then lastly like measure measure measure I have a extra slide just to show you a tool that I I really like but like all of this is they're all
00:31:48.120
trade-offs there's no like easy solution I'm not going to pit you this is like the like solving everything like create
00:31:53.919
like they're cheap to create so like dump one on your production database if you have a replica or if you have a readon connection see what it does like
00:32:00.639
measure there's probably indexes you can add there probably optimizations you can do I can't give you a solution but ultimately like your success here is
00:32:06.880
going to depend on your use cases and like the performance of it will depend on like what ques you're running and all of that um some annoyances to keep in
00:32:14.720
mind again not a tutorial but like if I say this to you you'll be like if you ever end up using this this will like uh
00:32:21.440
flip a light switch in your head somewhere and you be like oh this is a problem it's annoying um so the first
00:32:26.720
one is uh yes you can't if you're changing a view that depends on other views you have to drop the other views
00:32:32.120
first create your new view recreate the other views this is more of an annoyance uh you can do this in your
00:32:37.559
migration not a super big deal but something to be be aware of if you are evolving your your view
00:32:43.519
schemas um the second one a bit of a bigger deal like if you're dealing if you're working on teams but the way scen
00:32:50.120
is set up uh you basically create your table but there's nothing and it like applies review definition to your table
00:32:55.919
um but there's nothing in the table itself to tell you like what version you're running and this becomes a bit of a problem if you're like if you and your
00:33:02.480
colleague are working on like iterations of that view at the same time you could end up with like two of the same version
00:33:09.120
and like migrations that like and like your merge conflict won't make any sense because you're making like different
00:33:14.200
things to like a new version and like and you're so basically what you end up having to do is like accept that someone
00:33:19.600
is like going to go first and then rewrite your changes on top of their changes so that there's a nice like serial 1 2 3 4 and you can apply your
00:33:26.279
migrations you can back track based on the like the rails like version number so that it'll tell you like what was run
00:33:32.679
uh but in in terms of like development flows and like and Version Control uh this is something to bear in mind as
00:33:39.159
well um lastly yeah so when you create your view post will freeze the like
00:33:44.760
schema in time and so if you change a backing table um it like say you add a
00:33:49.799
column to a backing table and you're doing like a select star in your view that new column is not going to appear until you drop and recreate the view um
00:33:56.399
something to bear in mind cuz it could be a bit frustrating so additional a bit of additional complexity you have to be
00:34:01.519
be aware of if you're like relying on these pretty heavily um I'm going to stop there I have a couple extra extra
00:34:07.600
slides but before I switch to that any any questions comments can we see your
00:34:13.359
schema which one the the one from your examples uh sure it's in the repo I
00:34:19.560
don't know if I can uh let me see uh it's here
00:34:28.159
and then I can just show you I think this will work is it going to work come on
00:34:33.679
GitHub ah crap I'm not online all right uh let me do you mind if I go back to it
00:34:39.960
afterwards it's in like isn't it there in oh it's here sorry my slides R away yeah there you go where's my if I can
00:34:48.000
find my mouse I swear uh DB schema oh yeah so one thing I didn't
00:34:54.639
mention I can move this away go uh so yeah one thing I didn't mention
00:34:59.680
this ends up being appearing as like create view in your schema uh basically um Scenic adds these methods to like the
00:35:06.160
schema definition and so these just end up in your schema and you get the full definition here yeah to me that's one of
00:35:11.800
the annoyances like having that in my schema I mean how often do you read your
00:35:17.280
schema no I don't just having that shown and I don't know it's just a I
00:35:23.520
would rather have it get there and reference in the schema it's
00:35:28.640
of yeah it's the full view right yeah it's full view yeah it's a full query that you're doing I mean what we do is
00:35:34.720
like so there yeah and that that could be an annoyance especially because at work uh we run post 12 in production but
00:35:40.800
everyone there's basically in development environments there's a sliding scale of when they ran like burstall postrest when they like joined
00:35:47.040
the company and like the the like output of the schema definition is different
00:35:52.760
subtly different but still compatible between all the postest versions so every time you run a migration someone just like like creates a giant diff for
00:35:59.200
nothing and like that's really annoying but uh one thing we do to like cuz I don't stare at my schema that much but
00:36:05.119
like you you use the like annotate gem uh you just get these nice comments at the top so you don't have to care like
00:36:10.680
this is most like once you have this like most of us use that as a reference for like what the heck is my table
00:36:15.800
rather than like the schema itself cuz yeah it's like like you said giant giant pile of SQL so not that
00:36:22.480
useful uh any other questions um do you know if refreshing materialized view happens in a locking
00:36:28.560
transaction like what happens if you're trying to read that view as it gets if you're trying to read it if you're
00:36:34.119
reading it snapshot change all as once or if you're so if you're running it non-c concurrently it'll lock the whole
00:36:40.680
table uh so you're whatever you're doing is going to wait if you're run it concurrently it applies the same way as
00:36:46.440
like someone writing to the table row by row so you'll get whatever quy yeah you
00:36:51.640
could quer your Midway yeah in general I wouldn't recommend views for like like
00:36:58.040
if you're reading from it for a right because of if well sorry if it's a if it's a non-m materialized view then
00:37:03.599
whatever it's a query if it's a materialized view then you're going to have to deal with the fact that it's like stale and so if you're relying on
00:37:09.760
it to join in things and to like uh especially like if you're reading and writing inconsistent might
00:37:15.520
be different kind of fogam oh yeah if you're Midway through a refresh that also could also I guess you can do the
00:37:21.000
lhm thing where you create a new you could do that yeah a new materials you
00:37:26.079
yeah and like atomic right yeah this has yet to be a problem yeah like like I
00:37:33.000
said and like if this is if you're using this and you're running to that problem like maybe this isn't the right solution for you I think like our most of our
00:37:38.440
view passs are like the our read like our reads so this simplifies a lot of our read paths but if you're relying on it for for things like that like it
00:37:44.480
might be more of a problem yep um could you speak to using
00:37:49.960
views to solve really heavy joins and midsize rail apps so like for us we we
00:37:56.359
have like it's many records but we have hundreds of thousands of records and we have kind of
00:38:02.280
what you described earlier where it's like credits debits tons of stuff like that and we have a handful of queries
00:38:08.720
that just take forever to run they're very heavy um if you're in our shoes how
00:38:13.800
do you think about using use as a potential solution for that I mean the thing we do with uh if I go back to
00:38:21.599
here and I go back to where was it uh where are my examp here we go does
00:38:29.760
that do something here there we go yeah so this is actually AA similar case where like both of these are pretty like
00:38:36.440
the what we need to do to like obtain like uh unified sales is quite complex because all of these have like relations
00:38:43.000
in both directions um the trade-off you're paying for so like yes it can speed that up in the sense that like you
00:38:48.920
get a table at the end and that table is like on dis and can be indexed so you're you're basically reading as if like
00:38:54.720
you're reading from like a regular table with the tradeoff of like like potentially having like stale data cuz you do have to refresh it so in our case
00:39:01.880
like that was acceptable in this case but might not be a solution if you have uh if you if you need like Absolutely
00:39:07.640
Fresh data every time anyone
00:39:13.520
else cool uh so I got a couple extra slides and then I'm done I promise uh
00:39:18.880
yeah so bonus use case one extra slides something to be be uh be aware of but
00:39:24.839
you can use so uh in post I don't know if that's the case in other databases uh
00:39:29.880
simple views can you can write to them uh they have like you're allowed wees you're allowed different things but one
00:39:35.680
actually practical use case is if you need to rename a table and you don't want to take downtown you can create a
00:39:41.079
view with the well depends you can do it in both orders but like rename your table create a view nameing with the old name pointing to the to like the new
00:39:47.440
table and that will behave the same way as the other table they're just two names so long as you don't change your
00:39:52.680
like your schema in the time you're migrating but if you're changing the name I don't think you're changing your
00:39:58.079
schema as well but if you want to like like if you want a zero down time like table rename that's a really useful blog
00:40:04.720
post that explains this pattern but useful way to do it um the other thing I will show you is this repo which is oh I
00:40:13.400
don't have internet but I had it offline I think it's here uh but basically at this repo you will find a uh you will
00:40:18.920
find a pile of HTML uh that you can just download uh so I'm loading it from my uh
00:40:23.960
from my downloads folder here and this thing is a uh postgress explained
00:40:29.000
visualizer and so if you go to this is an old plan that I'm showing you but basically if you go to here you can you
00:40:36.359
basically slap that at the beginning of your query and you can and run that on postgress you'll get some Json out if
00:40:42.319
you paste the Json in you get a nice visualization like this this is a very
00:40:47.520
poor example because this whole Cy took less than a millisecond and so one thing
00:40:53.240
to bear in mind with this tool is everything is proportional so if it's a really fast query this looks really Terri CU it's very red but like you
00:40:59.280
shouldn't care about this cuz like overall it's like it's fast enough but if you're optimizing things like views or like optimizing you're like even like
00:41:06.040
in the case of materializer you're optimizing your your your like your definition uh like this thing is a life
00:41:13.079
is a godsend you can basically look at like by duration by number of rows this is really teeny but there's options over
00:41:18.359
here but then you can like zoom in and say like okay this is the query I use I think for the view but you can see like
00:41:23.680
what the heck it's doing and what order what indexes is using and basically see like from the databases perspective what
00:41:29.400
is it taking to like actually like return that query this is where my uh
00:41:35.880
Alexander like Calder like mobile like metaphor came back is it like I I've had
00:41:41.280
some that really like if I if I showed you the whole thing they wouldn't fit on the page so that's where like you can take these too far uh something to bear
00:41:47.920
in mind um and yeah that's it uh that's my whole talk uh I had a slide with my
00:41:53.720
socials but I don't don't care it's fine uh but anyway questions feedback