I am currently looking at a position that would have me leaving a traditional on-prem DW(teradata) to build a smaller scale cloud based solution. I’ve seen plenty of articles on the system architecture, but very little on data architecture. I’m specifically curious on practices for moving from dimensional tables(I’m assuming most joins are done on natural keys)? Also on what etl and handling incremental changes look like in this environment. Examples being say appending a day’s worth of sales data to an existing panoply table or processing address changes to customers on a regular basis. Can anyone point me in a direction to where I could learn on this and start feeling out my level of comfort in the move?
Please sign in to leave a comment.