• +1 407-906-9790
  • info@convergencedata.com
convergence-data-logo
  • Our Solutions
    • SmartSwitch
    • Data Services
    • Competitive Intelligence / Smart Teardown
    • Product Information Management (PIM)
    • Product Lifecycle Management (PLM)
    • Digital Asset Management
  • Who We Serve
    • Industries
      • HVAC
    • Marketing
    • Engineering
    • Ecommerce
  • About
    • Events & Conferences
    • Partners
  • Resources
    • Blog
    • Resource Center
    • Classification Community
    • DFR University
    Contact Us
    Contact Us
    con_logo

    What can we help you with?

    Follow Us

    4 Critical Steps to Control Duplicate Parts in Your PLM

    Richard Turner
    Sep 28, 2022
    Classification, Duplicate Parts, Data Cleansing

    4 Critical Steps to Control Duplicate Parts in Your PLM

    Duplicate parts are the software bugs of the data world—they’re hard to identify without specific tools and processes in place. If these “bugs” are not found during implementation, your PLM setup will become extremely time-consuming and costly down the road. 

     

    To put your organization in the best position for a successful PLM migration, you’ll want to give yourself enough lead time (i.e. six to 12 months) prior to the implementation to find duplicate parts and formulate a plan to deal with them.

     

    To exterminate the duplicate data pest, follow these four critical steps.

     

    1. Collect and Classify the Data

    It is key to run any duplicate analysis off of a specific category of data—which is why your first step is to classify the parts. Start by gathering all the purchase part data together from each legacy system, including:

    • Manufacturing part numbers
    • Supplier names
    • Pricing
    • Existing internal part numbers
    • Descriptions
    • Any existing commodity codes

    Typically, data from different systems need to be classified into a single data model, which will help dictate the categories. This, in turn, drives the attributes and allowed values of each part.


    2. Enrich and Validate the Data

    Once all the data from each system is classified, it’s time to enrich the data. You can harvest attribute data for each part using the manufacturer part numbers and associated source documentation.

    image3-4

    The data is usually obtained from approved manufacturer websites or company-approved documentation (e.g specifications, drawings, etc.). The harvested data should be loaded into a database and validated against the data model.

     

    For example, when validating air conditioner motors, enriching each motor with key information such as horsepower, speed, number of poles, phases, and frequency allows you to group duplicate motors and similar motors more easily. In some cases, motors with different manufacturer part numbers can have very similar characteristics.

     

    3. Perform a Duplicate and Near Duplicate Analysis

    Once the data has been enriched, normalized and validated, perform a duplication analysis. When grouping similar parts to help find near duplicates, a clustering tool will be helpful. 

     

    At Convergence Data, we refer to this as “neighbor distance.” It allows different weighting factors to be set on the critical attributes and the overall cluster. To make this analysis work, you must have normalized data with separate units of measurement.

    Duplicate_Blog_chart_v2.jpg

    These parts have the same attribute values and therefore may be duplicates.


    4. Process Identified Duplicates

    Last, take the groups of duplicates and select one internal part number to be the master for each manufacturer part number (MPN). Since you will likely end up with multiple internal part numbers for an MPN, we recommend setting up a master cross reference—this will consolidate spend and rationalize your inventory without disrupting existing BOMs.  

     

    Additionally, it’s encouraged to manage the master cross reference of internal part numbers to the MPN in your ERP system. Over time, engineering can use the master internal part number for new products, which allows a slow phase-out of the other internal part numbers.

     

    Solve Your Classification Problems

    Rich attribution and classification of data are pivotal to preventing and identifying duplicate and near duplicate data that lies within your systems today. Sign up for our Classification Community today for more tips on controlling duplicates.

    Join Our Classification Community Group

      Posts by Tag

      • Classification (53)
      • Cleansing Data (43)
      • PIM (35)
      • PLM (35)
      • Duplicate Parts (32)
      • Cost Savings (28)
      • Ecommerce (27)
      • Convergence Data (26)
      • DFR (23)
      • Governance (20)
      • data normalization (20)
      • Data Cleansing (18)
      • Parts Classification (17)
      • Taxonomy (17)
      • Data Governance (16)
      • Data Migration (16)
      • Digital Thread (16)
      • Manufacturer Parts (16)
      • Product Data (16)
      • Aftermarket Parts (15)
      • Bulk Loading Data (15)
      • Data Classification (15)
      • ERP (15)
      • Business Integration (13)
      • Product Information Management (13)
      • B2B (12)
      • Part Cleansing (12)
      • Product Analytics (12)
      • Teamcenter (12)
      • New Part Introduction (11)
      • Part Standardization (11)
      • Data Integration (10)
      • Digital Commerce (10)
      • Service Parts (10)
      • Cost Reduction (9)
      • DFR PLM Integration (9)
      • Engineering (9)
      • Findability (9)
      • Repair Parts (9)
      • Spend Rationalization (9)
      • Benchmarking (8)
      • Digital Transformation (8)
      • Duplicate Analysis (8)
      • Part cost (8)
      • Supplier Management (8)
      • Aerospace (7)
      • B2C (7)
      • HVAC (7)
      • Sourcing (7)
      • Spend Analysis (7)
      • Analytics (6)
      • Data Management (6)
      • Data Onboarding (6)
      • Mergers & Acquisitions (6)
      • Part Rationalization (6)
      • Workflows (6)
      • classification structure (6)
      • DAM (5)
      • Data Factory (5)
      • Direct Materials (5)
      • Distributor (5)
      • Enrichment Lifecycles (5)
      • Product Structures (5)
      • Purchased Parts (5)
      • Supplier Rationalization (5)
      • categories (5)
      • Business Case (4)
      • Clusters (4)
      • Customer Experience (4)
      • Data Validation (4)
      • Digital Assets (4)
      • Electrical Parts (4)
      • Electronic Parts (4)
      • OEM (4)
      • PTC LiveWorx (4)
      • Part Preparation (4)
      • Procurement (4)
      • Product Attributes (4)
      • Searching (4)
      • Value Engineering (4)
      • Windchill (4)
      • Competitive Analysis (3)
      • Component Data (3)
      • DFRv10 (3)
      • Data Policies (3)
      • Integration (3)
      • Loading Data (3)
      • Match and Merge (3)
      • PIM 101 (3)
      • PIM Migration (3)
      • PTC (3)
      • PTC Windchill (3)
      • Regulatory Compliance (3)
      • Relationship data (3)
      • SiliconExpert (3)
      • Standard Parts (3)
      • supplier pricing (3)
      • 2019 Blogs (2)
      • A2L (2)
      • Acquisition Onboarding (2)
      • B2B2C (2)
      • D2C (2)
      • Data Mapping (2)
      • Design Parts (2)
      • HFCs (2)
      • Hybris (2)
      • Kalypso (2)
      • M&A (2)
      • Mechanical Parts (2)
      • Omnichannel (2)
      • PLM World (2)
      • ROI (2)
      • Refrigerants (2)
      • Sales Conversions (2)
      • reclassify (2)
      • smartclass (2)
      • suma (2)
      • 2016 Top Blogs (1)
      • 2021 blogs (1)
      • Aftermarket (1)
      • Arbortext (1)
      • Category Editing (1)
      • DFR University (1)
      • DFR v13 (1)
      • Dictionary (1)
      • EPA (1)
      • Finished Goods (1)
      • GWP (1)
      • IHS (1)
      • IoT (Internet of Things) (1)
      • LiveWorx 2023 (1)
      • Metadata (1)
      • Multi-Tier Data Model (1)
      • NPI (1)
      • National Oilwell Varco (1)
      • Part Approval (1)
      • Part Obsolescence (1)
      • Part Reclassification (1)
      • Partnership (1)
      • Pricing Data (1)
      • Purchasing (1)
      • SAP Hybris (1)
      • SCM (1)
      • Shape-Based Search (1)
      • Siemens (1)
      • Sustainability (1)
      • Syndication (1)
      • Teardown (1)
      • Vendor Portal (1)
      • WBR Research (1)
      • allowed values list (1)
      • attribute data (1)
      • cx (1)
      • data (1)
      • outsourcing (1)
      • prune (1)
      • units of measure (1)
      See all

      Recent Posts

      Stay in the know!

      con_logo
      Convergence Data's proprietary software and time-tested processes eliminate the clutter in your data—so you can use it to make sound business decisions.
      Our Solutions
      • Data Services
      • Competitive Intelligence
      • DFR
      • Image Services
      Who We Serve
      • Industries
      • Marketing
      • Engineering
      about
      • About Us
      • Partners
      Resources
      • Blog
      • Resource Center
      • Classification Community

      © 2025 , Convergence Data All Rights Reserved.