How Google Analytics data is Integrated - the “Attribution Source Hierarchy”
When integrating data, we focus on Best-Signal Selection. SourceMedium uses Shopify as the robust transactional foundation (Source of Truth for “what was sold”), but we actively enhance that record with attribution signals from Google Analytics and other platforms. Rather than just “trusting” one over the other, we intelligently merge them. We prioritize the most specific and reliable signal for each order—this is our Attribution Source Hierarchy. In short, we report an aggregated dataset where Shopify’s financial records are enriched by GA’s behavioral data. For the full priority order, see Attribution Source Hierarchy. This enrichment can fill in gaps in both data sources — SourceMedium can regularly provide attribution for 10-30% more orders than Google Analytics on its own.Why doesn’t Shopify/SourceMedium match Google Analytics?
Google Analytics has some common failure points that can cause data sources to diverge:- Subscription Checkout Issues Subscription services like ReCharge often do not correctly attribute orders. This can happen for a number of reasons, and the solutions generally depend on the subscription service.
- Adblockers The data that GA provides is generally only as good as the tracking, and most, if not all tracking tech (pixels, cookies, tags, etc.) can be circumvented by customers.
- Faulty Tracking There’s no absolute right or wrong approach for setting up UTMs, but most companies make some sort of mistake when setting up tracking. Start with UTM Setup, then follow Improving Last-Click Attribution and this template.
- Factors not visible to SourceMedium Tracking is complex, and many other factors that are not visible to SourceMedium can come into play. It is generally reasonable to expect 10-20% discrepancies between Shopify (source of truth) and GA.
Intelligent Data Aggregation
SourceMedium doesn’t just rely on one method. Instead, we aggregate first-party data captured from all sources and intelligently merge it to create the highest quality record. This means:- More Data Sources = Better Output: Connecting more platforms gives SourceMedium more signals to resolve identity and attribution.
- Better Capture = Better Output: Improvements to your upstream tracking (e.g., a robust GA4 setup or server-side tracking) directly improve SourceMedium’s accuracy.
What do “Direct / None” & “None / None “ Source / Mediums mean?
(Direct) / (none) is the bucket we put things in when we don’t have access to attribution data. It means either people visited your website directly (manually typing in the URL) or something outlined above dropped attribution.
(None) / (none) traffic refers to visitors whose UTM source and medium values are both null and therefore cannot be identified by Google Analytics.
If you’re seeing one these source / mediums as your largest attribution category, usually it means something can be improved!

